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Preface 


In this monograph we present univariate and multivariate probabilistic inequalities 
regarding basic probabilistic entities like expectation, variance, moment generat- 
ing function and covariance. These are built on recent classical form real analy- 
sis inequalities also given here in full details. This treatise relies on author’s last 
twenty one years related research work, more precisely see [18]-[90], and it is a 
natural outgrowth of it. Chapters are self-contained and several advanced courses 
can be taught out of this book. Extensive background and motivations are given 
per chapter. A very extensive list of references is given at the end. The topics 
covered are very diverse. Initially we present probabilistic Ostrowski type inequal- 
ities, other various related ones, and Grothendieck type probabilistic inequalities. 
A great bulk of the book is about Information theory inequalities, regarding the 
Csiszar’s f-Divergence between probability measures. Another great bulk of the 
book is regarding applications in various directions of Geometry Moment Theory, 
in particular to estimate the rate of weak convergence of probability measures to the 
unit measure, to maximize profits in the stock market, to estimating the difference 
of integral means, and applications to political science in electorial systems. Also 
we develop Griiss type and Chebyshev-Griiss type inequalities for Stieltjes integrals 
and show their applications to probability. Our results are optimal or close to op- 
timal. We end with important real analysis methods with potential applications to 
stochastic. The exposed theory is destined to find applications to all applied sci- 
ences and related subjects, furthermore has its own theoretical merit and interest 
from the point of view of Inequalities in Pure Mathematics. As such is suitable for 
researchers, graduate students, and seminars of the above subjects, also to be in all 
science libraries. 

The final preparation of book took place during 2008 in Memphis, Tennessee, 
USA. 

I would like to thank my family for their dedication and love to me, which was 
the strongest support during the writing of this monograph. 
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Tam also indebted and thankful to Rodica Gal and Razvan Mezei for their heroic 
and great typing preparation of the manuscript in a very short time. 


George A. Anastassiou 

Department of Mathematical Sciences 
The University of Memphis, TN 
U.S.A. 

January 1, 2009 
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Chapter 1 


Introduction 


This monograph is about probabilistic inequalities from the theoretical point of 
view, however with lots of implications to applied problems. Many examples are 
presented. Many inequalities are proved to be sharp and attained. In general the 
presented estimates are very tight. 

This monograph is a rare of its kind and all the results presented here appear 
for the first time in a book format. It is a natural outgrowth of the author’s related 
research work, more precisely see [18]-[90], over the last twenty one years. 

Several types of probabilistic inequalities are discussed and the monograph is 
very diverse regarding the research on the topic. 

In Chapters 2-4 are discussed univariate and multivariate Ostrowski type and 
other general probabilistic inequalities, involving the expectation, variance, moment 
generating function, and covariance of the engaged random variables. These are 
applications of presented general mathematical analysis classical inequalities. 

In Chapter 5 we give results related to the probabilistic version of Grothendieck 
inequality. These inequalities regard the positive bilinear forms. 

Next, in Chapters 6-15 we study and present results regarding the Csiszar’s 
f-Divergence in all possible directions. The Csiszar’s discrimination is the most 
essential and general measure for the comparison between two probability mea- 
sures. Many other divergences are special cases of Csiszar’s f-divergence which is 
an essential component of information theory. 

The probabilistic results derive as applications of deep mathematical analysis 
methods. Many Csiszar’s f-divergence results are applications of developed here 
Ostrowski, Griiss and integral means inequalities. Plenty of examples are given. 

In Chapter 16 we present the basic moment theory, especially we give the ge- 
ometric moment theory method. This basically follows [229]. It has to do with 
finding optimal lower and upper bounds to probabilistic integrals subject to pre- 
scribed moment conditions. 

So in Chapters 17-24 we apply the above described moment methods in vari- 
ous studies. In Chapters 17-19 we deal with optimal lower and upper bounds to 
the expected convex combinations of the Adams and Jefferson rounding rules of a 
nonnegative random variable subject to given moment conditions. This theory has 
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applications to political science. 

In Chapters 20-22, we estimate and find the Prokhorov radius of a family of 
distributions close to Dirac measure at a point, subject to some basic moment con- 
straints. The last expresses the rate of weak convergence of a sequence of probability 
measures to the unit measure at a point. 

In Chapter 23 we apply the geometric moment theory methods to optimal port- 
folio management regarding maximizing profits from stocks and bonds and we give 
examples. The developed theory provides more general optimization formulae with 
applications to stock market. 

In Chapter 24 we find, among others, estimates for the differences of general 
integral means subject to a simple moment condition. 

The moment theory part of the monograph relies on [28], [23], [89], [85], [88], 
[90], [86], [44] and [48]. 

In Chapter 25 we develop Grtiss type inequalities related to Stieltjes integral 
with applications to expectations. 

In Chapter 26 we present Chebyshev-Grtiss type and difference of integral means 
inequalities for the Stieltjes integral with applications to expectation and variance. 

Chapters 27-28 work as an Appendix of this book. In Chapter 27 we discuss 
new expansion formula, while in last Chapter 28 we study the integration by parts 
on the multivariate domain. 

Both final Chapters have potential important applications to probability. 

The writing of the monograph was made to help the reader the most. The 
chapters are self-contained and several courses can be taught out of this book. All 
background needed to understand each chapter is usually found there. Also are 
given, per chapter, strong motivations and inspirations to write it. 

We finish with a rich list of 349 related references. The exposed results are 
expected to find applications in most applied fields like: probability, statistics, 
physics, economics, engineering and information theory, etc. 


Chapter 2 


Basic Stochastic Ostrowski Inequalities 


Very general univariate and multivariate stochastic Ostrowski type inequalities are 
given, involving || - ||. and ||- ||p,, p > 1 norms of probability density functions. 
Some of these inequalities provide pointwise estimates to the error of probability 
distribution function from the expectation of some simple function of the involved 
random variable. Other inequalities give upper bounds for the expectation and 
variance of a random variable. All are presented over finite domains. At the end 
of chapter are presented applications, in particular for the Beta random variable. 
This treatment relies on [31]. 


2.1 Introduction 


In 1938, A. Ostrowski [277] proved a very important integral inequality that made 
the foundation and motivation of a lot of further research in that direction. Os- 
trowski inequalities have a lot of applications, among others, to probability and 
numerical analysis. This chapter presents a group of new general probabilistic Os- 
trowski type inequalities in the univariate and multivariate cases over finite domains. 
For related work see §.S. Dragomir et al. ({174]). We mention the main result from 
that article. 


Theorem 2.1. ([{174]) Let X be a random variable with the probability density 
function f: [a,b] C R — Ry and with cumulative distribution function F(a) = 
Pr(X <2). If f € L,[a,b], p > 1, then we have the inequality: 


Pox <a)- (A) 
< (4) Isto «) \(e)" eg 
< (4) \Ifllp(b—a)!/%, for all x € [a,b], where - . =1, (2.1) 


E stands for expectation. 
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Later Brneti¢ and Peéari¢ improved Theorem 2.1, see [117]. This is another 
related work here. We would like to mention 


Theorem 2.2 ({117]). Let the assumptions of Theorem 2.1 be fulfilled. Then 


sere) (See 


b-—a (q+ 1)(b— a)4 : lf ll. (2.2) 


Pr(X <2) - ( 


for all x € {a, bj. 


2.2 One Dimensional Results 


We mention 


Lemma 2.3. Let X be a random variable taking values in [a,b] C R, and F its 
probability distribution function. Let f be the probability density function of X. Let 
also g: [a,b] > R be a function of bounded variation, such that g(a) 4 g(b). Let 
x € [a,b]. We define 


g(t) — g(a) 
Madar Sa eS 
P(g(x), g(t)) = ee (2.3) 
g —g 
O-a - = 
Then 
1 b b 
Fe) = aay i F(#)dg(t) + | P(g(2),g(t))F(Bdt. (24) 


Proof. Here notice that F’ = f and apply Proposition 1 of [32]. 


Remark 2.4 (on Lemma 2.3). i) We see that the expectation 
b 


b 
E(9(X)) = / a(t)dF(t) = g(b) - / F(t)dg(t), (2.5) 


a 
proved by using integration by parts. 
ii) By (2.4) we obtain 
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And we have proved that 


_ (a) = BX) — f’ pre, 
te) - (OA) = f rrate). acorn (2.6) 


Now we are ready to present 
Theorem 2.5. Let X be a random variable taking values in [a,b] C R, and F its 
probability distribution function. Let f be the probability density function of X. Let 


also g: [a,b] > R be a function of bounded variation, such that g(a) # g(b). Let 
x € [a,b], and the kernel P defined as in (2.3). Then we get 


| Fa) hak 


g(b) — g(a) 
(a IP(g aoa) ||Flloo, if f € Loo([a, 8), 
ae |P(9(x), 9(t))I, (2.7) 
1/q 
( [a |P(g ata) If, ff € Lp([a,d)), pa >1: 5+ 5=1. 


Proof. From (2.6) we obtain 


Feo (acme Jef Peers es 


The rest are obvious. 


In the following we consider the moment generating function 
Mx(x):= E(e**), x [a, bj; 


and we take g(t) := e*, all t € [a, J. 


Theorem 2.6. Let X be a random variable taking values in [a,b]:0 <a < b. Let 
F be its probability distribution function and f its probability density function of X. 
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Let x € [a,b]. Then 


if f € Loo([a, 4), 


max{(e® — e®), (e® — e®”)} 


(ev? — ev) 7 ' 
‘2 1/q 
lf llp ( at LaYd yp : ab at 4 dt 
Z reer fe er — ee *"\4dt + | (e”? — e*") ; 
if f € Ly € ([a, 4), pq>liste=l, (2.9) 


and in particular, 


3 
II f lle . [eo a) Mie (ee a) 4 ane 
xv 


(er? er) 
1/2 
3 Qerata? Qerdta? ; 
- ae + , ff La((a,0), 
xv av xv 


Proof. Application of Theorem 2.5. 


Another important result comes: 
Theorem 2.7. Let X be a random variable taking values in [a,b] C R and F its 
probability distribution function. Let f be the probability density function of X. Let 
x € [a,b] andmeN. Then 


| F(a) - A 


b™ — qm 
es 9) m+1 __ m+1 _ pm+l 
i =) {i t — Dame pyres — ame oral 
—-a m 


ile) Tf" ve 
2 (4) / oe —amytar+ f (b™ ya : 
if f € Lp([a,6)); pg >1:44+2=1, (2.10) 


and in particular, 


2 mt+1 
(; IE ) {om 0) + (a — 0) 
2m? 


Y Gat D@m+1) 


1/2 
vn gan ; if f € Lo({a, db). 


m 


Proof. Application of Theorem 2.5, when g(t) = t 


An interesting case follows. 
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Theorem 2.8. Let f be the probability density function of a random variable X. 
Here X takes values in [—a,a], a >0. We assume that f is 3-times differentiable 
on [—a, a], and that f” is bounded on |—a,a]. Then it holds 


px?) a (Alles fea + aso) = Co) - 
Ala! f ” 2 em 
S Tyg W6r@) + 67" + PF" Olleo- (2.11) 


Proof. Application of Theorem 4 from [32]. 


The next result is regarding the variance Var(X). 


Theorem 2.9. Let X be a random variable that takes values in [a,b] with proba- 
bility density function f. We assume that f is 3-times differentiable on [a,b]. Let 
EX =. Define 


a seit 
Then 
Van(X) + ((0~ 12400) ~ (w= a)? f(a) (u- SE*) 
+ {0 - WFO) + (ua) fla) + 510 WP) — u—a)?F(ah} 
? feo eo (= - me - =) | 


(3) b pb 
Hae ff tel. 30)l ler. 50)1- Inle2, adsrdsa 


if fe L ({a, 4), P,d> 1:4 a t= 1, 
aes pra (2.12) 


Ip(u; $1)| [p(s1, $2)| + |[p(s2,°)|loodsidse, 
if f® € Ly((a, 8). 


IA 


Proof. Application of Lemma 1([32]), for (t — w)* f(t) in place of f, and z = yp € 
[a, b]. 


2.3. Multidimensional Results 


Background 2.10. Here we consider random variables X; >0,7=1,...,n,n EN, 
taking values in [0, bj], b; > 0. Let F' be the joint probability distribution function of 
(Xy,...,Xn) assumed to be a continuous function. Let f be the probability density 


8 PROBABILISTIC INEQUALITIES 


function. We suppose that os (= f) exists on x?_,[0,b;] and is integrable. 
Clearly, the expectation 


n bi pbo bn 
E (11) = f of Uys On f(X1,02,...,%n)dv1 +++ dey. (2.13) 
4 0 Jo 0 


Consider on x?_,[0,6;] the function 
G(x, sae ,Ln) =1- S> F(b1, 02, soe »bj-1,%5, bj 41, oe On) 
j=l 


(3) 
+ S 5 F(b1, bay «+65 Bin, Bis «+ 485 Bj415 ++ 65 Onde 


fs=1 
VJ 


(3) 


+(-1)° SS FD 5 ois es Mice g tay ees Eh DE i5 «ven Oy) es 
be=1 
i<j<k 
fees (-1)"F(a1,...,2n)- (2.14) 


Above £ counts the (i, 7)’s, while ¢* counts the (i, j,k)’s, etc. One can easily estab- 
lish that 


O"Gn(x1, aes 50x) 


An ee: (2.15) 


We give 


Theorem 2.11. According to the above assumptions and notations we have 
n by bn 
E(][ x =| af Gn(a1,...,%n)dx,-+- dan. (2.16) 
i=1 o 0 


Proof. Integrate by parts G, with respect to x,, taking into account that 
G,(x1, La, ey En—1) bn) =0. 


Then again integrate by parts with respect to variable x,_; the outcome of the 
first integration. Then repeat, successively, the same procedure by integrating by 
parts with respect to variables 7,2, %p_3,...,21. Thus we have recovered formula 
(2.13). That is deriving (2.16). 


Let (%1,..-,2n) € X%%,[0, b;] be fixed. We define the kernels p;: (0, bi]? > R: 


Si, SLE [05-25], 


8, —b;, $4 © (x, dil, ery) 


Pi(Xi, $i) = { 


for alli =1,...,n. 
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Theorem 2.12. Based on the above assumptions and notations we derive that 


Ain = Coe). In (4,8 3) f(s1, 82,-.-,8n)ds1ds2---ds 
[0,04] 


I 


II b; Gn(@1,---;8i,---, Un) ds; 


b5 fis [0,64] Gn(s1,.--,2j,-+-,Sn)ds1--+ds;---dsy 


b; b; 
+ II br (/ Gales tiene cette) 
o Jo 
(€) 
+ 


=e Ce ack (11) Se (2.18) 


The above £ counts all the (i,9)’s: i < j andi,j =1,...,n. Also ds; means that 
ds; is missing. 

Proof. Application of Theorem 2 ([30]) for f = Gp, also taking into account (2.15) 
and (2.16). 


As a related result we give 


Theorem 2.13. Under the notations and assumptions of this Section 2.3, addi- 


tionally we assume that ||f(s1,---,5n)|loo < +oo. Then 

A Il f(s1,--+5 8n)lloo = 2 2 

Bank sae YLT le? + eae (2.19) 
for any (a1,..-,%n) € X},[0, bi], n EN. 


Proof. Based on Theorem 2.12, and Theorem 4 ([30]). 
The Ly analogue follows 


Theorem 2.14. Under the notations and assumptions of this Section 2.3, addi- 
tionally we suppose that f € Ly(xii,[0,bi]), %e., ||fllp < too, where p,q > 1: 
ab < =1. Then 


n 1/q 
82, n| S aos {Ter + ate ai nage] ' (2.20) 
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for any (a@1,...,%n) € X#,(0, bi]. 
Proof. Based on Theorem 2.12, and Theorem 6 ([30]). 


2.4 Applications 


i) On Theorem 2.6: Let X be a random variable uniformly distributed over [a, J, 


0 <a< b. Here the probability density function is 
1 
f®o= roa (2.21) 
—a 
And the probability distribution function is 


wa 


F(a) = P(X <a2)= ; a<a<b. (2.22) 
-—a 
Furthermore the moment generating function is 
Mx(z) ee <a<b (2.23) 
x) = —— a<au<b. : 
* (b—a)x’ = es 


Here || foo = g4z, and ||fIlp = (b—a)>—, p > 1. Finally the left-hand side of (1.9) 
is 


a (2.24) 


bx eof _ ett 
(F = *) Crt. a(b—a 
Then one finds interesting comparisons via inequality (2.9). 


ii) On Theorem 2.7 (see also [105], [107], [117], [174]): Let X be a Beta random 
variable taking values in [0,1] with parameters s > 0, t > 0. It has probability 
density function 

f(0;s,t) = ; (2.25) 


where 


1 
B(s,t) al 711 — r)*" dz, 
0 


the Beta function. We know that E(X) = 34, see [105]. As noted in [174] for 
p>i1,s>1—<, andt>1—< we find that 
1 
. = =— -[B(p(s— 1) +1, p(t- 1) +1)”. 2.2 
If 8.9 = Bray BOCs - 1) + 1-1) +0) (2.26) 


Also from [107] for s,t > 1 we have 


(8 — 1)°-1(t — 1)! 


fC 8: Ollo = BEDisp ts DS (2.27) 
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One can find easily for m € N that 
I] (s+m-— J) 
BOC 2 (2.28) 
I] (s+t+m-—j) 


Here F(x) = P(X < x), P stands for probability. 
So we present 


Theorem 2.15. Let X be a Beta random variable taking values in [0,1] with 
parameters s,t. Let x € [0,1], mE N. Then we obtain 


[[(s+m—j) 


B 


P(X >a) — | == 
Leet) 


( (s—1)*-1(¢ — 1)" = 


1- hy t>1 
BGs, )(s+t— 2)? ial. ar i ea 


max(a#’",1— 2"), when s,t > 0, 


1 y/ mati 1 1/q 
— ss (Ble 1) het 1) 4 1). | 1 Pa 
Bed [B(p(s — 1) +1, p( )+1)] + fi ) | ; 
< 
r= eel 1 
when p,qg>1:-+-—=1, ands,t>1—--, 
, Pp 4 Pp 
3 (BOs 12a 2.2 
Bis.) [B(2s —1, )} (2.29) 
Qemtl 2m? nee 1 
h t>-. 
(= to) ek 


The case of « = 4 is especially interesting. We find 


Theorem 2.16. Let X be a Beta random variable taking values in [0,1] with 
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parameters s,t. Let meEN. Then 


: I] (s+m—j) 
p(x25)- ie 
Il (s+t+m-—j) 
j=l 
(yee!) (Ge=1) 1 
B(s,t)(s +t — 2)s+t-2 m+1 2? 
when s,t > 1, 
1 
bee? when s,t > 0, (2.30) 
1 
1 1 1 . q 
-(B -—1 1,p(t-1 1)|> - | —————_ 1—t™)%dt 
<} Fey Ble D +e +0 ramet + feta” 
1 
when p,q>1:-+-=1, ands,t>1—--, 
pq p ; 
1 1 1 1 2m? 2 
2, ame Re en (mm nr 
Fay Bes - 128-9 tl soot! 


when s,t > =. 
2 


2.5 Addendum 


We present here a detailed proof of Theorem 2.11, when n = 3. 


Theorem 2.17. Let the random vector (X,Y, Z) taking values on tee b;], where 

b; > 0,4 = 1,2,3. Let F be the cumulative distribution function of (X,Y,Z), as- 
3 

sumed to be continuous on TL, [0, bi). Assume that there exists f = so > 0 


a.e, and is integrable on T12_1[0, 4). Then f is a joint probability density function 
of (X,Y, Z). Furthermore we have the representation 


bi ba b3 
E(XYZ) = i i G3(a, y, z)dxdydz, (2.31) 
0 Jo Jo 
where 


G3(z, y, Z) = 1- F(a, b2, b3) _ F(b1, y, 63) ar F(a, y, b3) 


=F (bi, bo, 2) + Eg, bo, 2) no F(b1,y, 2) bo: F(a, Yy, 2) (2.32) 


Proof. For the definition of the multidimensional cumulative distribution see 
[235], p. 97. 
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We observe that 


[ (” ( i Hears] i) dz 
= [ (/" (/° ort las) iv dz 


b3 ba 62 
=| —"_(F(b,y, 2) — F(o,y,2)) dy | dz (2.33) 
0 0 OyOz 
b3 
= Oz [F'(b1, be, z) ae F(0, be, z) — F(by,0,z) + F(0,0, z)] dz 
0 vA 
= F'(b1, bz, b3) — F(0, be, b3) — F'(b1,0, b3) + F(0, 0, 63) — F(b1, b2, 0) 
+ F(0, b2,0) + F(b1,0,0) — F(0,0,0) = F(b1, be, b3) = 1, (2.34) 


proving that f is a probability density function. 


We know that 
bi bo pbs 
E(XYZ) “J f i, xyzf (x,y, z)dudydz. 


We notice that G3(z, y, b3) = 
So me have 
bs 


b3 
CNC CHC et oe | CNC) 
0 


b3 b3 
-- | zd,Gale.y.2)= | xar(a, y, z)dz, (2.35) 
0 0 
where 
OF (61, be, z OF (a, ba, z OF (by, y, z OF (a, y,z 
ay(x,y, 2) = ee) oe Bt)’ OF (hilas®) ar OE) oa ) + OP eyes) ) (2.36) 
Next take 
be 
ion 
0 
f s z) OF(x,b2,z) OF (b1,y,2) . OF (x,y,z) 
= sO — eee eee sO | 
Oz Oz Oz 
OF ( 2 = OF (x, ba, 2) _ OF (bi, y, 2) fh OF (x,y, 2) be 
Oz Oz Oz y 
be ae F(b1,y,z) 0° F (x,y,z) be 
o) 7 7 9 — 9. 
where 


0? F(b1,y, 2) OF (x,y, 2) 


02 (2, Y, Z) = 1 ae (2.38) 
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Finally we see that 


bi 
‘i Q(x, y, z)dx 
0 
= [ O° F (bi, y, 2) = OP F(z, y, 2) d 
ae OyOz OyOz is 


0? F(b1,y, Z) OP F(z, y,z) by i OF (a. 9,2) 
= x | ———— —- ———— ] - 5 - x | -—————_ ] dr 
OyOz OyOz 0 OxOy Oz 


b b 
SOP ues. 


Putting things together we get 


ies (f° ( is Galen 2M] iy) da 


II 
os 
ia 
Lon OS 
os 
a 
aN 
o— 
Ss 
& 
Q 
= 
ae. 3 
8 
S 
R 
~—S" 
Q 
R 
Ne 
Q. 
eS 
SS 
Q 
8 


-[": (= (/[" ssn) iv a 


by bo b3 
= if | i xyz f(x,y, z)dudydz = E(XYZ), (2.40) 
0 Jo Jo 


proving the claim. 
Remark 2.18. Notice that 
0G3(a, y, 2) = OF (01, be, 2) as OF (a, bo, z) 
Oz Oz Oz 
OF (b1,y,z) OF (x,y,z) 
Oz 7 Oz : 
2’G3(x,y,z) OP F(bi,y,z) O° F(a, y,z) 
OyOz -_ OyOz — OyOz 


and 
O° G3(z, y, 2) SF (x,y,z) ies 
dxOyOz  —=<“C«COYO F(a, 2). 
That is giving us 
O°G3(a, Y; z) 


Chapter 3 


Multidimensional Montgomery Identities 
and Ostrowski type Inequalities 


Multidimensional Montgomery type identities are presented involving, among oth- 
ers, integrals of mixed partial derivatives. These lead to interesting multidimen- 
sional Ostrowski type inequalities. The inequalities in their right-hand sides involve 
L,-norms (1 < p < oo) of the engaged mixed partials. An application is given also 
to Probability. This treatment is based on [33]. 


3.1 Introduction 


In 1938, A. Ostrowski [277] proved the following inequality: 


Theorem 3.1. Let f: [a,b] — R be continuous on [a,b] and differentiable on (a,b) 
whose derivative f’: (a,b) — R is bounded on (a, 6), #.e., ||f’|lo = sup |f’(t)| < 
tE (a,b) 
+oo. Then 
ae i lo G=seF 
oe - ee ON oo 
sz f foe F@] s |G+5 FF] -6-olrle, OD 


for any x € [a,b]. The constant + is the best possible. 


Since then there has been a lot of activity about these inequalities with inter- 
esting applications to Probability and Numerical Analysis. 

This chapter has been greatly motivated by the following result of S$. Dragomir 
et al. (2000), see [176] and [99], p. 263, Theorem 5.18. It gives a multidimensional 
Montgomery type identity. 


Theorem 3.2. Let f: [a,b] x [c,d] — R be such that the partial derivatives Of tts) 
Of (t,8) ies) 
tos 


Os ? 


, 


exist and are continuous on [a,b] x [c,d]. Then for all (x,y) € 
(a, b] x [c,d], we have the representation 


few= toa somal ff te orasaes f° foe, PICS) sa 
+f fia q(y,s OM ass f° fp (x, t)q(y, s ana (3.2) 


15 
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where p: [a,b]? — R, q: [c,d]? = R and are given by 
t—a, ifte [a,c] 
t)ys= 
P(z,t) e if t € (x,D] 
and 


~~ Pee. afsele4] 
dus= {oro if 5 € (y, dj. 


In this chapter we give more general multidimensional Montgomery type identi- 
ties and derive related new multidimensional Ostrowski type inequalities involving 
| - lp (1 < p < 00) norms of the engaged mixed partial derivatives. At the end 
we give an application to Probability. That is finding bounds for the difference of 
the expectation of a product of three random variables from an appropriate linear 
combination of values of the related joint probability distribution function. 


3.2 Results 


We present a multidimensional Montgomery type identity which is a generalization 
of Theorem 5.18, p. 263 in [99], see also [176]. We give the representation. 


Theorem 3.3. Let f: x3_, [ai,bi] > R such that the partials Oise) j= 


1,2,3; career j<k, 9,k € {1, 2,3}; Bf lsssta89) exist and are continuous on 


x3_, (ai, di]. Let (1, 22,73) € x2 on bj]. Define the kernels p;: [a;, bi]?  R: 


$—a;, 9; € [a;, 2] 
pil as i) {2 = b;, SEE (85, 0:| 


fori =1,2,3. Then 


f (x1, £2, 23) 


by bo b3 
= mas ‘eA f( $1, $2, §3)ds3ds2ds1 


3 
TL (6: — aj) 


of 3 Be Ps Us Of(s1, 82,83) 4 d d 
We (x5, 8;) 08; 83052481 


j=1 
3 br pba pb 2 
1 2 3 O ; ; 
+ ys (/ ; fn pj (@5,87)Pk(Lk, Sk) - AH eset) isadsyds () 
=1 j 
j<k 


b b 3 
1 2 3 0° f(s1, 82, 83) 
+f i. [ (Toto) “Daas asdsads, (3.3) 


Above € counts (j,k): 7 < k; j,k © {1,2,3}. 
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Proof. Here we use repeatedly the following identity due to Montgomery (see [265], 
p. 565), which can be easily proved by using integration by parts 


1 B i B j 
gu) = sa f ole)de+ af ou2)a' ede, 


where k: [a, 3]? — R is defined by 


= zZ—-a, if z € [a, ul 
k(u, 2) = ome if z € (u, 8], 


and g is absolutely continuous on [a, /]. 
First we see that 


f(21,%2,%3) = Ap + Bo, 


where 
i bt 
Ag := a f(s1,%2, %3)ds1, 
1— a4 
and 
1 by Of (s1 v2 x3) 
Bo := Pail, eh: ar eo a 
0 bai i pi(#1, 81) sy ds, 
Furthermore we have 
f(s1,%2,%3) = Ar + Bi, 
where 
1 be 
A\:= ie f(s1, $2, %3)dse, 
and 
1 be Of (s1 52 x3) 
By := pan aS el Bd 
1 pute [ p2(X2, $2) Aso ds 
Also we find that 
1 bs 
f(s1, 82,23) = eer (81, $2, $3)ds3 
1 ‘i (0, 99) Of (818289) 4 
bea Jas P3(X3, 83 Bas 3: 
Next we put things together, and we derive 
Aj = fs =a) (bs a) Ts =a) rede “#( 81, 82, 83)ds3ds2 


be bs Of (s1 52 53) 
Ob — Me ae) 7a i: p3(#3, $3) Dis ds3ds2 
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And 


Big pba pbs 
Ag = a. pe f (81, 2, 83)ds3dsgds1 
“Ti (bj — a;) 


3 
IT (i — ai) 
Bas pba pba 
ea i [ p3(#3, $3) AA se ssdsadsy 


= 
bi be 
$1, 52,2 
p2(X2, 82) Of(s1 82 SSH iPS) Fi, 
ieee bn A) =n Ose 


Also we obtain 


Of (81, 52,23) pee: ie OF (si, $2183) 4, 


Os bg — a3 O82 


1 bs 0? f (81, 82, 8 
/ pala ee ees e 3) ass. 


bs — a3 083082 


Therefore we get 


by bo b3 
Ag = 3 a Fe ah f (81, $2, $3)ds3dsods1 
it (d; = ai) 
bi be b3 
=, [ [ms p3(x3, 83) 2A ses) 2,83) ds3dsods1 


i= 
bi bo bg 
81, 82,8 
a E i p2(x2, 82) on ae z ACTRESS ee 
82 
bi bo bg 
=). [ [mi p2 (x2, 82 )p3(#3, 83) 


. iy $2, 83) 
083082 


=. 
ice 


cies 


i=l 


ome) 


7 


ds3ds9ds1. 
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Similarly we obtain that 


by bo bs 
Bo = ee i, [om Pi (a1, $1) Ofts1, 83:83) $2) $3) ds3ds9ds,1 
$1 


[ (bs — a5) 


? 


br bz pbs 0? f (81, 89,8 
Bs etn [ [om Pi(21, 1)p3(3, $3) OMe. 82:88) 3) deadendey 
IT ( (b; — aj) 3051 
bi be bs 0? f(s1 52 83) 
> ———"— ds3dsod 
+S [ i [om P1(£1, $1)p2(£2, $2) —3————— Os0% $352.08) 


I] @s — 2%) 


i=l 


by bo bs 
[ i : [ P1(@1, 81)p2(%2, $2)p3(x3, 53) 


(Wee) 


il id 


. ers $2, 583) 


ds3ds2dsj. 
083082081 ree 


We have proved (3.3). 
A generalization of Theorem 3.3 to any n € N follows: 


Theorem 3.4. Let f: x? [ai,b;] —~ R such that the partials 


af oo ‘ BOT 2 

Dag em oalnd = Niece aoa 

: ay 

gk € {1,...,n}; Bee 

: olf Oo" f 
DEG Neots a pinatoae © = ONO Behe cenatoa: 


all exist and are continuous on x'_,[a;, bi], n € N. Above 08% means Ose is missing. 


Let (1,.--,2n) € X%4[ai, bi]. Define the kernels p;: [ai, bi]? — R: 


8, — Qj, $i © [ai, Zi], 


i( Li, Si) = 1 = 1,2,...,0. 3.4 
Ble #4) a 8; € (x, bi], ee ‘ ( ) 
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Then 
1 
f(a, 22; :Ln) = 
(11 ai)) 
1 f(s, $2, $n) dS8nd8n—1 ds, 
xP, [a:,b:] 
: AF(81, 89, ---5$n) 
+ / pj (&j, 83) —— ds, --- ds1 
s( xf [aid] Os; 
2 
OF f(Sis S855 Bq 
+ ys (/ P5185) Pe (ey 84) LPB Sd, ae “) 
f= \YXtealasbe] SkOS; (1) 


lg=1 iz 1[ai,b 
j<k<r 
3 
. O TASS) 9 otal 
08,08,08; (2) 


eee 
pee De (/ pi (1, 81) += pel@e, 80) +“ Pulm; Sn) 
xi y [ac,di] 


f=1 
OP—E (6i, cess BH) sa 
Aasouneo as) 


O” f (s1, eaey Sn) 
+f ig: - (Io Xi, Si ) Oe ae 8 in} 7 (3.5) 


Above ¢; counts (j,k): 9 < k; j,k © {1,2,...,n}, also €2 counts (j,k,r): 7 < 
k <r; j,k,r © {1,2,...,n}, etc. Also pe(ae, se) and dsg means that pe(xe, se) and 
dse are missing, respectively. 


Proof. Similar to Theorem 3.3. 


Next we present some Ostrowski type [277] inequalities. In particular the result 
that follows generalizes Theorem 5.21, p. 266 from [99], also the corresponding result 
from [176]. 


Theorem 3.5. Let f: x3_, [a:,b;] > R as in Theorem 3.3. Then 


bi be b3 
f(@1, 22, £3) — aes | ya 7 ($1, $2, 83)ds3dsqds1 
TI (0: — a:) 


w=1 


3 
< (M13 + Mo,3 + M3,3)/ 1G = «)) (3.6) 


i=1 
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Here we have 


He (bj — ai) | - ee | 


ee 
if 52 (xy (Aiba) Sag 1278, 
=: (| st I] i — aj) )/4 
Mi 3 := < 
(@j—aj) 49484 (05-05) t* ay 1/45 
[ee ae ? 
-¢ O 
if $f € Lp, oe a pj >1, 
7 om , for 7 =1,2,3; 
3 
of b;—a; +b; 
» (||36 gees : )). 


if BE Li (x%jlai,bil) , for j =1,2,3. 


Also we have 


3 3 
orf 
b; — a 
De lee II ( a) 
l=1 i=1 
j<k tk 
3/ (2) 
f Facda; © boo (xiailai, bil) #5 € {1,2,3}; 
J 
3 3 
Of 
»» (=a)! 
f=1 coe Pri I 
j<k iA#j,k 
[eames yagi 
M23 := dkj +1 
esas) 
a | oy 
af im ‘a 1 
if ——— € Lp,; (x1 [ai bi]) ,pey > 1, — + — =1, 
T 55,05; nee | ri }) Pkj Daj Thi 
if j,k € {1,2, 3}; 
b; — a; ay +b; 
> (lan, (5% +f- 5%) 
fk 
b. — 
; k Oi Mian, Ge Oh 
2 2 (0) 
. Of 
of € Ly (x3 {= 114i, bil), for j,k € {1,2 mile 


08,08; 


21 


(3.8) 
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And finally, 


3 

((aj; — aj)? + (b; — 2;)?) 
Pica). garet Te 
08308208, 
F 0° f (81, 82, 83) 3 
if Seeste, © boo (Lalas bil) 
AFF (s1, 82,83) |] 
08308208, 


P123 j=1 
Ls — As q3t1 4 b; —x,;)u123 tt 1/qi23 
se 4 ee tee 
s qi23 + 1 
_, O° f(s1, 82,8 
if 7 fe st) € Lpyog (x34 [a:, bi]) ,pi23 > 1, 
1 


(3.9) 


08 f (si, $2, 53) 
08308208, 


_, Of (81, $2, 83) is 
if ecdeeds, © Et Otaalei, bid) - 


jo 


Inequality (3.6) is valid for any (a1, 22,23) € X#_,[ai, bi], where || - ||p (1 < p < 00) 
are the usual Lp-norms on x3_,[ai, bi] 


Proof. We have by (3.3) that 


by be b3 
f(@1, 22,23) — zee 8 ede f ($1, $2, 83)ds3dsqds1 
I] — aj) 


= 


=e ELE L Lorn 


jot dsadsads:) 

Os; 

3 bi bo pbs 
we (/ ‘ / pj (a5, 85)] > pe(we, Sx)| 

jane 

2 
oe asdsads: 

08,08; @ 

bi be bs 2 : 

O° f (1, $2, 83) 
iA Xj, $4 aerate roma . 

+f A i. (I [pi(x ») | 083082081 er 
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We notice the following (j = 1, 2,3) 


oe ee . |e 
Ip; (2j, 85) | 


ds3dsods, 


by pbs pbs 
(leh a i: i |p; (a7, 8;)|ds3dsods1, 
it 2 ao E Loo (X94 [ai, bil) ; 
1/45 
< malt Ae [ [Cu |p; (aj, 3; pees) ; 
im Bs) ! ety, (x3 i [as,d:]) ,p i Say +o 13 
Is) . ee \p;(a;,8;)|, if con Ely (xe :[a:, bi]) : 


Furthermore we observe that 


bi bo pbs 
/ / / |p; (23; 8;)|ds3dsods, 
a1 ag Jaz 


= (bi — a1)(bj — a;)(b3 — a3) pe 


bj — 4; means (b; — a;) is missing, for 7 = 1,2,3. Also we find 


by bo bs 1/qj 
/ ' : pees) Masvdsss, 
ay a2 a3 


= (by — a1)'/% (B; — a;)'/% (bs — ag)! % 


S _ aj;)utt + (b; — | 1/45 
qj+1 : 


(b; — a;)/% means (b; — a;)'/% is missing, for 7 = 1,2,3. Also 


b; — a; a; +b 
sup |p;(x;, 8j)| = max{zj — aj,bj — aj} = << tee 5 2), 
87; €[a;,b5] 
for 7 = 1,2,3. 
Putting things together we get 
3 by be bs Of S 
1, 52, 53 
Ss? (/ ) i, pj (xj, 85) - ee ds3ds2ds1 
j=l a1 a2 a3 $j 


23 
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Os; 
3 3 
of 1/43 
D2 8s; | []@: - ai) q 
j=l Pj i=1 
< Aj (3.10) 
{@ 0g APE (bg = - i 
af qjt+1 : 
1 1 
if — € Ly, (x2, [a:,b:]) ,p; > 1, —+ — = 1,for j = 1,2,3; 
; as; p; ( il ) ji Dp Jj 
Ss Of (4 1 - SS4)) 
7A Os; 1 2 2, 
ts) 
fF ky (x} #3 [ag bj] ) for 7 =, 2,3. 
J 


Similarly working, next we find that 


Sf PP ws eine oy | 
5 (25, 95)] + [Pe(Te, Sk 8,08; 


iat 


L (é) 


j< 


At 


3 
(bi — ai) 
t=1 (be 087.08; ||, set 
i<k iXzj,k 
4 


a. 


Oy 
ef 


f Loo (X21 (04, bi]) ,&, 9 € {1,2, 3}; 
7 paee = (xzerlas, ), open. ’ 3}; 


3 
b; — a;)1/93 
mal, {i ( ) (3.11a) 


fe t#j,k 
dei +1 

: + 1 ) 

= ) 


IA 


08,08; Pag OES LD SU) 9 PRG ; 
1 1 : 
—+t+—= 1, for Uk € {1, 2, 3}; 
Pkj kj 


if 
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a aie 0? f (81, 82, 8 
Se i [ Ip;(@j, 83)|* |Pe(k, Sk) | OH (s1, 82,83) 1, 82 $3) ds3dsods, 
= ~ As,08; ; 
fk (é) 
Bi Oe e.g Os 
f=1 > (ZS, ( 2 3 a5 
j<k 
< bp — b 
= if ee es ORE (3.110) 
2 2 () 
Of 
if € Ly (x2, [a:, bi]) , for j,k € {1, 2, 3}. 


08,08; 
Finally we obtain that 


ae I 20 aye 
pil 49 84 ~ 083080081 

3 

eae 


ay)” + (bj — @3)*) 


ds3dsods1 


08 f (s1, $2, 83) j=l 
08308208, 5S FF 
0? f (s1 52 $83) 
if =<" € Lae (x7, [ai, di) ; 
, 08308208 : oe bil) 
& f(s1, $2, 83) Il (Grantee + ager 
< 083082081 |I,,., 5 qi23 + 1 (3.12) 
08 f (s1 82 83) 1 1 
if ——————e L #1 lai, bi) , i aes es 
: 0530808 : pias (Gels, bil) Pras 2123 - 1123 
0° f (81, 82, 83) Tl i 
083082081 1 fel 2 ? 2 : 
0° f (81, 82, 83) 
if ey, is bi]) 
: 08308208 : (<i ail, y 


Taking into account (3.10), (3.11a), (3.11b), (3.12) we have completed the proof of 
(3.6). 


A generalization of Theorem 3.5 follows: 


Theorem 3.6. Let f: xt, [ai,b:]) —~ R as in Theorem 3.4, n € N, and 
(@1,.--,2n) € Xty[ai,b;]. Here || - ||p (1 < p < oo) is the usual Lp,-norm on 
xt [a:,b;]. Then 
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Here we have 


Also we have 
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n 


Oo, i=1 
fj 
(xj — a5)? + (bj — 23)? 
2 ’ 
0 n 
if 55, © boo (xiaalai, bi]) 5 = 1,2, n; 
j 
DS 52 [[@- i)" 
a ip \ ist 
wAj 
(a; — aj)4*1 + (bj; — aj)9+ 1/q 
gr ; 
i) 
if so € Lp (Xjiy[as, b:]) ,p > 1, 
ee er ee 
n | 
> (| of (4 Fae a; +b; )) 
me Os; ||, 9 =o 
) 
pcb € Ly (x, [ai,b:]), for 7 =1,2,...,n. 
j 


Of n 
bj ‘i 
Py | 08,08; II ( a ) 
j<k je 
_ (ej = 45)" + (bj = 25)") - (ee = a4)? + (be = ee)") 
4 
o? 
= € Loo (XP y[a:, bil) 9, & € {1,2,...,n}; 
j 
; os 7 1/ 
_ q 
M2.n = »D | 08,08; Il (bi — aa) 


qt+1 
On 

L ” lag, db; 1 
08,08 ; p (Xfi lai, p> ’ 
1 1 
=~ = 1jfor 7, ke {1,2,...9n}; 


(3.14) 


(3.15a) 
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bj — 4; aj +; ) 
+ |g, —- + 
is >. ([a2z Os,08; ||, (AS 9 j 
ae 
Mon = (4: — az aptbe )) 
dc re eee a 
- (41) 
i orf EL (x? [a; bil) , for ke {1 2 nt} 
0sj08; : t=1L09 Vt Js Sie ees 7 
And, 
(3) n 
of 
» lacor I] @- a) 
tat eet diet 
j<k<r phe 
(2m a am) + (bm _ Bie) 
I] 93 ’ 
a (¢2) 
of O° f € Doo (KL, [ai, bi]) 9, kr © {1 nh; 
087,08,08 ; a i=1 (G4, 94]) 9,8, penny : 
‘ : 
a. Ae De. b; — a;)1/4 
2 laaeml, Il ( ai) 
pee i#j,kyr 
M3,n = II as = am) tt _ (Om — 2x ) li | 
m€{j,k,r} q Pe 
Of 
ep dy um i, OF ; ie 
uy 08,08,08; € Lp (x71 [ai, bi]) ,p > 
1 1 ; 
—~+-<=1,for j,k,r © {1,2,...,n}; 
P @ 
s: lac’ 1 
a a ee Leet a 
j<k<r 
(mo + in thn) 
of eS Tee, | 
; a (£2) 
Of 


if 08,08,08 ; 


€ Ly (x, (a, bi]) , for j,k,r € {1,2,...,n}. 
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(3.150) 


(3.16) 
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And, 
(ara) on-lf 
eee (be —= ag) 
2 'N | Oo  OBp~ 4084 lls 
Pr] Im — Om)? + Di — Bin)o 
Mn-1,n = note! ( ) ) (3.17a) 
a 
n—1 
uf Fame ae a E Leo (x4 [a:, bi) sl = 1 
Bie Ost OS, 
tata) 
olf 
ee |S PT 
3S joo x (be — ae) 
iM E = Oy, )tt} ie (Din =: a 
m=l1 q+1 
mA#L ; 
. an-lf ; 
if Tan Oy Ooi € Ly (xfy[ai, bi]) ,p > 1, 
Mn-1,n = —-+-=1,0=1,...,n; (3.17) 


) olf 
O8n ++: O8¢-++ O81 7 


I brn = Gm Am + bm 
5 es Lp, - ——— : 
2 2 


Finally we have 


oO” f ja 
O8n°++ O81 be Qn : 
if anf ela (Relat 
O8n Foi Os 
eee | (eee 
Mann = OSn cits Os, p jai qt 1 5 (3.18) 
. O” f t: 7 
f 55.0 Bay © Ee (xterlei be) p> 1, aarti 
Oo’ f # b; a; aj + b; 
nos 1 IL 9 ie 2 ; 
, Oonf : 
if Fp, ay © Es <teala, bil) 


Proof. Similar to Theorem 3.5. 
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3.3 Application to Probability Theory 


Here we consider the random variables X; > 0, i = 1,2,3, taking values in [0, bj], 
b; > 0. Let F ee the joint probability distribution function of (X1, X2, X3) assumed 
to be in C3(x3 is 10; b;]). Let f be the corresponding probability density function, 


that is f = Ee Clearly the expectation 


bi bo b3 
E(X1X2X3) =i | | 11903 f (x1, 02,03) dri dx2dz3. (3.19) 
0 0 0 
Consider on x?_,[0, };] the function 
G3(a1, £2, U3) =1- F(a, ba, bs) = F(b1, x2, bs) = F (by, be, v3) + F (21,22, bs) 
+ F(a, be, v3) + F'(b1, 2, v3) — F (2x1, 22,23). (3.20) 


We see that 
0°G3(x1, 22, 03) 


Ax 022023 = —f (x1, 22, v3). (3.21) 
In [31], Theorem 6, we established that 
by bo b3 
E(X1X2X3) —_ il | G3(x1, 22, x3)dx1dr2dx3. (3.22) 
0 0 0 


Obviously here we have F'(21, 62,63) = Fx,(%1), F(b1, 22,63) = Fx, (%2), F(b1, 
bz, 43) = Fx,(x%3), where Fx,, Fx,, Fx, are the probability distribution func- 
tions of X1, X2, X3. Furthermore, F'(21,2%2,b3) = Fx,,x,(%1,%2), F(a, b2, 23) = 
Px Xs (21s ea); F(b1, 2, £3) = Fx, ,X3(X2, 23), where Ps Xs Ee owes Py are 
the joint probability distribution functions of (X1, X2), (X1, X3), (Xo, X3), respec- 
tively. That is 
G3(a1,€2, 03) = 1 — Fx, (a1) — Fx, (#2) — Fx, (@3) + Fx,,x2 (#1, £2) 
+ Fx,,x5(@1,%3) + Fx,,x3(€2, 03) — F(a1,22,03). (3.23) 


0? Fx, x 0 Fx, x: 
We set by fl = Fx,1 da = Fxys £8 = Fxg» fiz = “gaydae> £3 = Gaydeg > and 
fog = 2s | the corresponding existing probability density functions. 


022023 
Then we have 


0G3(#1, £2, £3) OF x, xy (£1, £2) 


On, 7 ~ files) % Ox 
Fa OF'x,,x3(1,03) — OF (v1, £2, £3) 
Ox, Ox1 , 
0G3(21, ©, U3) _ OF x, x5 (@1, £2) 
Ox 7 ~ falv2) ss Ox 
OF’x,,X3(@2,%3) OF (%1, £2, 23) 
0x2 0x2 ; 
0G3(41, 2, U3) _ OF x, x3 (@1, 3) 
0x3 = —f3(x3) + 0x3 


OF x, ,X3 (£2, £3) _ OF (x1, 2, £3) 


- 0x3 0x3 


(3.24) 
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Furthermore we find that 
OG3(x1, 22,03) _ OF (x1, €2, 23) 
Ox 0x2 7 fia(#1, #2) 7 Ox 0x2 
0G3(x1, £2, £3) 0? F (x1, 22, 23) 
= Pies edad nie a 3.25 
022023 fas (2, v3) 022023 , ( ) 
and 
0G3(#1, 2, £3) 0? F (x1, 22,23) 
Ox 0x3 0x1 0x3 ; 
Then one can apply (3.6) for a1 = a2 = a3 = 0 and f = G3. 
The left-hand side of (3.6) will be 


= fig(%1, 23) — 


1 by bo bs 
G3(x1, £2, £3) — sue | | ; G3(81, $2, 83)ds8;ds2ds3 


- la — F (x1, b2, b3) — F'(b1, 2, b3) 
_ F'(by, bg, x3) + F(21, £2, 63) + F (x1, be, v3) 
, (3.26) 


+ F(b1, 22,03) — F(x1,22,%3)) — X,X2X3) 


1 
ae 
by babs ( 


by using (3.20) and (3.21). Here (x1, 22,23) € x%_,[0,b;]. Next one can apply 
(3.21), (3.24), (3.25) in the right side of (3.6). 


Chapter 4 


General Probabilistic Inequalities 


In this chapter we use an important integral inequality from [79] to obtain bounds 
for the covariance of two random variables (1) in a general setup and (2) for a 
class of special joint distributions. The same inequality is also used to estimate the 
difference of the expectations of two random variables. Also we present about the 
attainability of a related inequality. This chapter is based on [80]. 


4.1 Introduction 


In [79] was proved the following result: 
Theorem 4.1. Let f € C"(B), n © N, where B = [a1,bi] x +--+ X [Gn, bn], a, 
b; € R, with aj < bj, j = 1,...,n. Denote by OB the boundary of the box B. 
Assume that f(a) =0, for all x = (a1,...,U%n) € OB (in other words we assume that 
f(--+4;5,°°-) = fC ++, 8;,---) =90, for all j =1,...,n). Then 
m(B) OPT, ee iy) 

vy Un)| day +++ d&p_ < —— — 

[lolersmvtn)| ders doy < MED f ATO 


where m(B) = Tj Oy — aj) is the n-th dimensional volume (i.e. the Lebesgue 


dx,:+:dt%p, (4.1) 


measure) of B. 


In the chapter we give probabilistic applications of the above inequality and 
related remarks. 


4.2 Make Applications 


Let X > 0 be a random variable. Given a sequence of positive numbers b,, — ov, 
we introduce the events 


By, ={X > bn}. 
Then (as in the proof of Chebychev’s inequality) 


E[X 1z,] = [ X dP > by P(Bn) = bnP(X > bn), (4.2) 
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where, as usual, P is the corresponding probability measure and F its associated 
expectation. If EF [X] < oo, then monotone convergence implies (since X1 pe / X) 


lim E [X11] = E[X]. 
Thus 
E[X1p,] = E[X]- E[X1p.] > 0. 
Hence by (4.2) 
bn P(X > bp) 0, (4.3) 


for any sequence b, — oo. 
The following proposition is well-known, but we include it here for the sake of 
completeness: 


Proposition 4.2. Let X > 0 be a random variable with (cumulative) distribution 
function F(a). Then 


E[X]= i "tL — Fla)lae (4.4) 


(notice that, if the integral in the right hand side diverges, it should diverge to +00, 
and the equality still makes sense). 


Proof. Integration by parts gives 


a [l — F(x)] dx = 


lim [t= F(a] + f° 2dF(e) = jim 2P (x >a) + BLN], 


L200 


If E[X] < ov, then (4.4) follows by (4.3). If E[X] = oo, then the above equation 
gives that 


[ u-Fe@lde=~, 


since rP (X > x) > 0, whenever x > 0. 


Remark 4.3. If X > a is a random variable with distribution function F(x), we 
can apply Proposition 4.2 to Y = X — a and obtain 


Co 


E(X)=a+ / [1 — F(x)] dx. (4.5) 


a 


We now present some applications of Theorem 4.1. 


Proposition 4.4. Consider two random variables X and Y taking values in the 
interval [a,b]. Let Fx (x) and Fy(x) respectively be their distribution functions, 
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which are assumed absolutely continuous. Their corresponding densities are f x(x) 
and fy(x) respectively. Then 


~—* f lfx(a) - fr(@)lae (4.6) 


Proof. We have that Fx (b) = Fy(b) = 1, thus formula (4.5) gives us 


|E[X]— E[Y]| < 


b b 


EX] =a+ [ [l— Fx(a)|dx and EY|=a+ f [1 — Fy (a)]da. 


a a 


It follows that 
b 
E(X]- EW < f Fx(0) - F()| de. 


Since Fx(x) and Fy(x) are absolutely continuous with densities fx(x) and fy(x) 
respectively, and since F'x (a) — Fy(a) = Fx(b) — Fy(b) = 0, Theorem 4.1, together 
with the above inequality, imply (4.6). 


Theorem 4.5. Let X and Y be two random variables with joint distribution func- 
tion F(x,y). We assume thata < X <bandc<Y<d. We also assume that 
F (x,y) possesses a (joint) probability density function f(x,y). Then 


ey ig d b 
Jeo Xv] s SNE PP flay) = foto)frtwlaedy, (47) 


where 
cou(X,Y) = E[XY]—-E[X]E[Y] 


is the covariance of X and Y, while fx(a) and fy(y) are the marginal densities of 
X and Y respectively. 


Proof. Integration by parts implies 
d pb 
E(XY] =aE[Y]+ckE X]-ac+ [ / (1 — F(a, d) — F(b,y) + F(a, y)| dady (4.8) 


(in fact, this formula generalizes to n random variables). Notice that F(a, d) is the 
(marginal) distribution function of X alone, and similarly F'(b, y) is the (marginal) 
distribution function of Y alone. Thus, by (4.5) 


b d 
B[x]=a+ | (1 — F(x, d)] dx and Bly|=c+ | [1 — F'(b, y)] dy, 
which derives 


d b 
(E[X] -«) (B[Y]-¢) = ‘| / [1 — F(e, a)] [1 - F(b,y)] dedy, 


or 


E[X] E[Y] = aB[Y] + cB [X] —act+ 
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[ fi [1 — F(a, b) — F(a, y) + F(a, d)F (6, y)| dady. (4.9) 
Subtracting (4.9) from (4.8) we get 
cov (X,Y) = E[XY]- n= fe (x,y) — F(a, d)F (a, y)| dxdy. 


Hence 


|cov (X,Y < fo fr |F(x,y) — F(x, d)F(b, y)| drdy. 


Now F(a, y) = F(a,c) =0 and F(b,d) = 1. It follows that F(a, y) — F(x, d)F(b, y) 
vanishes on the boundary of B = (a,b) x (c,d). Thus Theorem 4.1, together with 
the above inequality, imply (4.7). 


In Feller’s classical book [194], par. 5, page 165, we find the following fact: Let 
F(a) and G(y) be distribution functions on R, and put 


U (2, y) = F(2)Gy) {1—a[l— F@)|- Gm, (4.10) 
where —1 < a < 1. Then U(z,y) is a distribution function on R?, with marginal 
distributions F(a) and G(y). Furthermore U(x, y) possesses a density if and only 
if F(x) and G(y) do 
Theorem 4.6. Let X and Y be two random variables with joint distribution func- 
tion U(ax,y), as given by (4.10). For the marginal distributions F(a) and G(y), of 


X and Y respectively, we assume that they posses densities f(x) and g(y) (respec- 
tively). Furthermore, we assume thata < X <b andc<Y <d. Then 


|cov(X,Y)| < lal a (4.11) 
Proof. The (joint) density u(x, y) of U(a, y) is 
u(a,y) = FEY — p(a)gly) {1 ~ aft -2F@)] [1 2G)]}- 
Thus 
u(e,y) — f(2)gly) = —a[1 — 2F(2)] [1 — 2G@)] F(e)9(9) 
and Theorem 4.5 implies 
|lcov(X,Y)| < 
jj Coto) sna f' In — 2F()| Fe we} | f CONG vy). 
or 
lcov (X,¥)| < af PME 9 pi ar(xyBi—2G~)]. 4.12) 


Now, F(x) and G(y) have density functions, hence F'(X) and G'(Y) are uniformly 
distributed on (0,1). Thus 


E(|l—2F(X)|] = Ell —2G(Y)|] = 


1 
2 
and the proof is done by using the above equalities in (4.12). 
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4.3 Remarks on an Inequality 


The basic ingredient in the proof of Theorem 4.1 is the following inequality (also 
shown in [79]): 

Let B = (a1,b1) x --+ X (Gn, bn) C R”, with a; < b;, 7 =1,...,n. Denote by B 
and OB respectively the (topological) closure and the boundary of the open box 
B. Consider the functions f € Co(B)M C"(B), namely the functions that are 
continuous on B, have n continuous derivatives in B, and vanish on OB. Then for 
such functions we have 

Oo” f oa vagy) 


Fen tn) Sap f [ea 


true for all (x1,...,%n) € B. In other words 


dx, ---dzn, 


o” es see DL Zin) 
ee ED ae she, 41 
Ill Sop f [AE ae de (1.13) 
where || - ||, is the supnorm of Co(B). 
Remark 4.7. Suppose we have that aj = —oo for some (or all) j and/or b; 
oo for some (or all) 7. Let B= = BU {oo} be the one-point cottipaetitieation Ot of 
B and assume that f € Co(B)NC"(B). This means that, if a; = —ov, then 


lims,—oo f (#1, -.-; 8j,--,2n) = 0, for all x, € (ax,b,), & A Jj, and also that, if 
bj = oo, then lim 365 f (@15.«,55.+-,2n) = 0, for all xy. € (ag, bg), BA 7. 
Then the proof of (4.13), as given in [79], remains valid. 


Remark 4.8. In the case of an unbounded domain, as described in the previous 
remark, although || f||,, < 00, it is possible that 


; OP f (@15%%5 na) 
B| Ox1-+-OXn 


For example, take B = (0,00) and 
f(z) = | G& = Feitlte-*) dé. 
0) 2 
Before discussing sharpness and attainability for (4.13) we need some notation. For 
c= (C1,..-;Cn) € B, we introduce the intervals 
Tj,0(c) = (aj, C;) and T51(c) = (c;, b;), j = 1, coeg 
Theorem 4.9. For f € Co(B)C"(B), inequality (4.13) is sharp. Equality is 


dx, ---dx%yn = . 


attained if and only if there is ac = (c1,...,¢n) € B such that the n-th mized 
derivative O” f (#1, ...,%n)/OX1 +--+ OX, does not change sign in each of the 2” “sub- 
bowes” Tye, (c) X +++ X In,e, (€)- 


Proof. (<) For an arbitrary c = (c1,...,¢n) € B we have 


fle, agen) 7 (Sissies if an f eA a -din, (4.14) 
lh it ) In,en (C) Ox, - OLn 
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where each ¢€; can be either 0 or 1. If c is as in the statement of the theorem, then 


O” f (241, 5 En 
Ylersendi= fof 8 F(@1; Fn) 
The, (¢) In,en (0) 


0x1 +++ OXn 
Adding up (4.15) for all 2” choices of (€1,...,€n) we find 


dz, -++d&y. (4.15) 


pena ai) 
DH pieeee ———————_| d dxtyn, (4.16 
oe ie dae ae ts Ome Oe,, ne 
or 
-_ O" f (a1, ..., 2 a) 
hei: Gn) | = an Ohana | dz, ++: dzpy. 


This forces (4.13) to become an equality. In fact we also must have 


IF (C1y +n) = II Flo » 
ie. |f(x)| attains its maximum at x =c. 
(=) Conversely, let c € B be a point at which |f(c)| = ||f||,, (such a c exists 


since f is continuous and vanishes on OB and at infinity). If for this c, the sign 
of the mixed derivative 0" f(x1,...,2n)/Ox1-+- Ox, changes inside some “sub-box” 
Tye,(c) x +++ x Ine, (c), then (4.14) implies 


Il = L#lerraen)l < f ae 7 


Thus forces f, (4.13) become a strict inequality. 


eae sey En ) 
0x1 ++: OXn 


dx, ---d&y. 


4.4 Lq,q >1, Related Theory 


We give 


Theorem 4.10. Let f € AC([a,b]) (absolutely continuous): f(a) = f(b) =0, and 
pq>l:steeal. Then 


[ velar < 9" ( [ire iran) " (4.17) 


Proof. Call c= 4%. Then 


and 
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ois f “Te w)ldy, 


For a<a<c we observe, 


and 
i: If(2)|de < / / LF" (w)ldyde 
(by Fubini’s theorem) 
- dx \ d — yf’ wld 
ff ice) y= [ e-wlroley 
c 1/p 
< ( { (c- y)Pdy) IF"lhcocae): 
That is, 
a If(a)|de < ee maT (4.18) 
243 (p+ 1)i/P Oe 


Similarly we obtain 


n)| < i “wildy 
/ “[fa)lde < / if “UP (laude 


(by Fubini’s theorem) 


b y b 
=f f irwlazay = f welt wiidy 
‘ 1/p 
< (/ yor) IIF'Ilz2¢c,b): 


44 


[veonas < estan (4.19) 


and 


That is, we get 


Next we see that 


(/ ite) : = if iy eitoee i: toi q 
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(b— a)'tp . r 
: Gas (W/'lascaey + IF este) 


b—a)!t : 
< (oe) 2° (IMF Ifa ane) + UF Ibaceay) 


= (G20? Ve 
= 243 (p+ 1)i/e IF W50¢a,0): 


That is we proved 


: (b-a)*? 11), 

Vela s ipa ie *\|f"|lLa(a,b); (4.20) 
which gives 

[seer s EP IF hse (4.21) 


proving (4.17). 


Remark 4.11. We notice that lim In(p+1)!/? = 0, that is lim (p+ 1)/? = 1, 
proc poco 
Let f € C’([a,b]), f(a) = f(b) = 0. Then by Theorem 4.1 we obtain 


b b 
b-a 
[ ltolae s "$* [ir wiay, (4.22) 
which is (4.17) when p = oo and gq = 1. When p = q = 2, by Theorem 4.10, we get 
1/2 
b 2 b 
(b = a)?! i / 2 

dx << ——=—_ d ; 4.23 
[isle < Ef royeay (4.23) 


The L,, ¢ > 1, analog of Proposition 4.4 follows 


Proposition 4.12. Consider two random variables X and Y taking values in the 
interval [a,b]. Let Fx(x) and Fy(x) respectively be their distribution functions 
which are assumed absolutely continuous. Call their corresponding densities fx (x), 
fy (x), respectively. Let p,q>1: “ + - =1. Then 


(b— a) )its é a 
B(X) — BO)| < ( [ex — reo vial) (424) 
Proof. Notice that 
b 
|E(X) — B(Y)| < ) |Fx(2) — Fy(2)|dz. (4.25) 


Then use Theorem 4.10 for f = Fx — Fy, which fulfills the assumptions there. 


Observe that (4.6) is the case of (4.24) when p = oo and q=1. 


Chapter 5 


About Grothendieck Inequalities 


We give here the analog of Grothendieck inequality for positive linear forms. We 
find upper and lower bounds of L, 0 < p < ov, type, which all lead to sharp 
inequalities.This treatments is based on [68). 


5.1 Introduction 


We mention the famous Grothendieck inequality [204]. 


Theorem 5.1. (Grothendieck 1956) Let Ky, and Ky be compact spaces. Let wu : 
C (ki) x C(K2) — R be a bounded bilinear form. Then there exist probability 
measures pl, and fg on Ky and Ko, respectively, such that 

1/2 


lu (F,9)| < KB |e ( a fans) ( a “dua , (5.1) 


for all f € C(K) and g € C(K2), where KE is a universal constant. 
We have that 
7 


Tv 
1.57...=—< KR < —____ = 1.782.... 5.2 
2 OF Sita) ve) 


See [204], [240], for the left hand side and right hand side bounds, respectively. 

Still kK, ce precise value is unknown. 

The complex case is studied separately, see [209]. Here we present analogs of 
Theorem 5.1 for the case of a positive linear form, which is of course a bounded 
linear one. 

We use 


Theorem 5.2. (W. Adamski (1991), [3]) Let X be a pseudocompact space and Y 
be an arbitrary topological space, then, for any positive bilinear form ® : C (X) x 
C(Y) — R, there exists a uniquely determined measure ts on the product of the 
Batre o— algebras of X and Y such that 


B(f.g) = I fe adn (5.3) 
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holds for all f € C(X) and allgeEC(Y). 


Note. Above the tensor product (f ® g) (x,y) = f (x) -g(y), X pseudocompact 
means all continuous functions f : X — R are bounded. Positivity of ® means that 
for f > 0, g > 0 we have ®(f,g) > 0. 

From (5.3) we see that 0 < u(X x Y) = @(1,1) << w. 

If ® (1,1) = 0, then ® = 0, the trivial case. 

Also |®(f,9)| < ®(1, 1) ||f ll llgll.. with |/®|] = @(1,1), in the case that X, 
Y are both pseudocompact, so that ® is a bounded bilinear form. 


5.2 Main Results 


We give 


Theorem 5.3. Let X, Y be pseudocompact spaces, ®: C(X) x C(Y) > R be a 
positive bilinear form and p,q>1: aig =1. Then there exist uniquely determined 
probability measures [41, 120n the Baire o—algebras of X and Y, respectively, such 


that 
eral <i ( f (edn) ([ jane) (5.4) 


for all f € C(X)and allg E C(Y). 
Inequality (5.4) is sharp, namely it is attained, and the constant 1 is the best 
possible. 


Proof. By Theorem 5.2 we have 


B(f.g) = I F(a) alu) dele), (5.5) 


and 4 is a unique measure on the product of the Baire o— algebras of X and Y. 
Without loss of generality we may suppose that 6 4 0. Denote by m := u(X x Y), 
so it is 0 < m < oo, and consider the measures ju* (A) := «(Ax Y), for any A 
in the Baire o—algebra of X, and yw** (B) := w(X x B), for any B in the Baire 
o—algebra of Y. Here yu*, ** are uniquely defined. Notice that 


wt (X) = (Xx ¥) =p (¥) =m. (5.6) 
Denote the probability measures p41 := x [2 = ae 
Also it holds 
f ve@ra=f era, (5.7) 
XxY x 


and 
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| lo (v) [2 du = i: la (yl? du. (5.8) 
xXxxY Y 


So from (5.5) we get 


IO(f,9)IS fs If ()| 19 (y)| du (@, y) (5.9) 


(by Holder’s inequality) 


< (fo isa au(e.y)) i (f lotenlt(x,u)) 65.10) 
=(fireorae @) (fio ent @) (5.11) 
=m(f 1s? din @) (floor @) 


proving (5.4). 
Next we derive the sharpness of (5.4). 
Let us assume f (2) = ci > 0, g(y) = cg > 0. Then the left hand side of (5.4) 
equals cicgm, equal to the right hand side of (5.4). That is proving attainability of 
(5.4) and that 1 is the best constant. 


We give 
Corollary 5.4. All as in Theorem 5.3 with p= q= 2. Then 


eral <iiell( [. (rte)? dm) ([ (atu)? au) ae (5.12) 


for all f € C(X) andallg Ee C(y). 

Inequality (5.12) is sharp and the best constant is 1. 

So when ® is positive we improve Theorem 5.1. 
Corollary 5.5. All as in Theorem 5.3. Then 

1/p 1/q 
erolsiel, int (firc@ran) (f wld). 643) 
{p.q>u:tttai} \x Y 

for all f € C(X) and all g € C(Y). Inequality (5.13) is sharp and the best constant 


is 1. 


Corollary 5.6. All as in Theorem 5.8, but p=1, q = co. Then 
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1B (f.9)1 < III Male ( iL I («)| dua) | (5.14) 


for all f EC (X)and all ge C(Y). 
Inequality (5.14)is attained when f (x) > 0 and g(y) =c> 0, so it is sharp. 


Proof. We observe that 


8 (F.9)1 < laos hes It (0) | du (2.9) 
= lll ( [r@lae (=) (6.15) 


= mlolec ( [lan (2) (5.16) 


= lll ( fF (oldu) 


Sharpness of (5.14) is obvious. 


Corollary 5.7. Let X, Y be pseudocompact spaces, ®: C(X) x C(Y) > R be a 
positive bilinear form. Then there exist uniquely determined probability measures 
L41, 2 on the Baire o— algebras of X and Y, respectively, such that 


1 (F, 9)| < [ll] min (Ul. ( [ I (2)1 dun) floc ( i ja(v)l 2) ) , (6.17) 
for all f € C(X) and all gE C(Y). 


Proof. Clear from Corollary 5.6. 


Next we give some converse results. 
We need 
Definition 5.8. Let X, Y be pseudocompact spaces. We define 
Ct (X):={f : X > R,U{0} continuous and bounded} , (5.18) 
and 
Ct* (Y) :={g:Y — R, continuous and bounded} , 


where Ry := {x ER: x >O0}. 
Clearly C* (X) C C(X), and Ctt (Y) CC(Y). 
So C** (Y) is the positive cone of C(Y). 


Theorem 5.9. Let X, Y be pseudocompact spaces, ® : Ct (X) x Ctt(Y) —R 
i = 
P 


be a positive bilinear form andO <p<l,q<0:=+ ; 1. Then there exist 
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uniquely determined probability measures [41, 2 on the Baire o— algebras of X and 
Y, respectively, such that 


w(F9) > Well (f. (F(a) dms) Gs (o(u)tdna) (5.19) 


for all f € C* (X) and allg € CTT (Y). 
Inequality (5.19) is sharp, namely it is attained, and the constant 1 is the best 
possible. 


Proof. We use the same notations as in Theorem 5.3. By Theorem 5.2 and reverse 
Holder’s inequality we obtain 


vna= ff _ F(a) 9(0) du (au) 


(FP dulew) (fan) 6.20) 
(= Ce ) 
= (fcr eey a @) ([ owt aves @) 


=m(f (rey? din @) (f oy" tn w) (5.21) 


proving (5.19). 
For the sharpness of (5.19), take f(x) = ci > 0, g(y) = co > 0. Then L.A.S 
(5.19) = R.H.S (5.19) = meice. 
We need 


Definition 5.10. Let X, Y be pseudocompact spaces. We define 
Cy (X):= {f : X > R_U{0} continuous and bounded} , (5.22) 
and 
C~~ (Y) :={g:Y — R_ continuous and bounded}, 


where RL :={xER: x <0}. 

Clearly C~ (X) C C(X), and C~~ (Y) CC(Y). 

So C—~ (Y) is the negative cone of C'(Y). 
Theorem 5.11. Let X, Y be pseudocompact spaces, ® : C7 (X) x C7~(Y) —R 
be a positive bilinear form andO <p<1l1,q< 0: aor ; = 1. Then there exist 


uniquely determined probability measures [41, [42 on the Baire o— algebras of X and 
Y, respectively, such that 


w(f9)> lel (f seam) @ jl") (5.23) 


44 PROBABILISTIC INEQUALITIES 


for all f € C7 (X) and allg € C7 (Y). 
Inequality (5.23) is sharp, namely it is attained, and the constant 1 is the best 
possible. 


Proof. Acting as in Theorem 5.9, we have that ®(f,g) > 0 and 


5(f,9) = a _ F(a) 9(0) du (eu) = I _F@lla@lde(ey) 6.24) 


> (fier ay ww) Cf. lotenitae ww) (5.25) 
= (fire wm) (f[ iotenlt @) 


zn([ip@raat)) ({ w@ltanw) ¢ 1628) 
xX Y 


establishing (5.23) . 
For the sharpness of (5.23), take f (x) = ci: < 0, g(y) = c2 < 0. Then L.H.S 
(5.23) = mceicg = R.H.S (5.23). 
Hence optimality in (5.23) is established. 


We finish this chapter with 
Corollary 5.12. (to Theorem 5.9) We have 


®(f,9) = ||| sup 
{p,4: 0<p<1,q<0,4+4=1} 


(raya) (f (ytd) (5.27) 
xX Y 


for all f € C* (X) and allg Ee Ctr (Y). 

Inequality (5.27)is sharp, namely it is attained, and the constant 1 is the best 
possible. 

And analogously we find 


Corollary 5.13. (to Theorem 5.11) We have 


® (f,9) = ||®| sup 
{p,4: 0<p<1,q<0,4+4=1} 


wean) (fie@itaa\ | (5.28) 
xX Y 


for all f © C7 (X) and allg EC (Y). 
Inequality (5.28) is sharp, namely it is attained, and the constant 1 is the best 
possible. 


Chapter 6 


Basic Optimal Estimation of Csiszar’s 
f-Divergence 


In this chapter are established basic sharp probabilistic inequalities that give best 
approximation for the Csiszar’s f-divergence, which is the most essential and general 
measure for the discrimination between two probability measures.This treatment 
relies on [37]. 


6.1 Background 


Throughout this chapter we use the following. Let f be a convex function from 
(0, +00) into R which is strictly convex at 1 with f(1) = 0. 

Let (X,A, 2) be a measure space, where . is a finite or a o-finite measure on 
(X, A). And let ji, “2 be two probability measures on (X, A) such that 1 <4, 
[ig < A (absolutely continuous), e.g. A = ti + pa. 

Denote by p = tn q= She the (densities) Radon—Nikodym derivatives of j11, 
Lg with respect to A. Here we suppose that 


hegeh eh a.e. on X anda<1<b. 
qd 


The quantity 


yuna) = f acai (A) ance (6.1) 


was introduced by I. Csiszar in 1967, see [140], and is called f-divergence of the 
probability measures 4; and pg. 

By Lemma 1.1 of [140], the integral (6.1) is well-defined and Tp(j11, 2) > 0 
with equality only when 41 = fg, see also our Theorem 6.8, etc, at the end of 
the chapter. Furthermore I'f(u1, 42) does not depend on the choice of A. The 
concept of f-divergence was introduced first in [139] as a generalization of Kullback’s 
“information for discrimination” or I-divergence (generalized entropy) [244], [243] 
and of Rényi’s “information gain” (I-divergence) of order a [306]. In fact the 
I-divergence of order 1 equals 


Du logs u( fA, [i2). 
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The choice f(u) = (u— 1)? produces again a known measure of difference of distri- 
butions that is called .?-divergence. Of course the total variation distance 


a= I tp() — q(e)| ad(2) 


is equal to P),,_1)(f41, #2). Here by supposing f(1) = 0 we can consider I (p11, 12) 
the f-divergence as a measure of the difference between the probability measures 
M1, Ha. 

The f-divergence is in general asymmetric in fz; and jig. But since f is convex 
and strictly convex at 1 so is 


f*(u) = uf (+) (6.2) 
and as in [140] we obtain 


Dy (Ha, Hi) =D y* (11, Ha). (6.3) 


In Information Theory and Statistics many other divergences are used which 
are special cases of the above general Csiszar f-divergence, for example Hellinger 
distance Dy, a-divergence Da, Bhattacharyya distance Dg, Harmonic distance 
Dua, Jeftrey’s distance Dz, triangular discrimination Da, for all these see e.g. 
[102], [152]. 

The problem of finding and estimating the proper distance (or difference or 
discrimination) of two probability distributions is one of the major ones in Prob- 
ability Theory. The above f-divergence measures in their various forms have been 
also applied to Anthropology, Genetics, Finance, Economics, Political science, Bi- 
ology, Approximation of Probability distributions, Signal Processing and Pattern 
Recognition. 

A great motivation for this chapter has been the very important monograph on 
the matter by S. Dragomir [152]. 


6.2. Main Results 


We give the following. 
Theorem 6.1. Let f € C1({a,b]). Then 


Dy(041, #2) < [If'lloo,ta. 7 Ip(a) — a(z)| aA(c). (6.4) 


Inequality (6.4) is sharp. The optimal function is f*(y) = |y —1|*%, a > 1 under 
the condition, either 

i) max(b—1,1—a) <1 and C:=esssup(q) < +o, (6.5) 
or 


ii) lp—q|<q<1, ae. on X. (6.6) 
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Proof. We see that 


V5 (441, 2) = [aes (A) dy 
=f aes (2) dy — f(1) 
= f aay (HS) ar— f aafayar 
= fe) (1(22) - 10) a 
< fata) (4 ~ (1a 


SW Fiean I (2) 


=F ste : bp() — g(2)| da, 


proving (6.4). Clearly here f* is convex, f*(1) = 0 and strictly convex at 1. 


Also we have 


f(y) = aly — 1/9" sign(y — 1) 


and 
Nt lees fee = a(max(b -—i1,1- a))*. 
We have 
LHS.(6.4) = f (a(a))-*[p(x) — ale)|° a, 
xX 
and 
R.H.S.(6.4) = a(max(b—1,1—a))*" rs |p(a) — q(a)| dd. 

Hence 


lim R.HS.(6.4) = I Ln) — o(x)| aa. 
Based on Lemma 6.3 (6.22) following we find 
lim L.H.S.(6.4) = lim R.H.S.(6.4), 
establishing that inequality (6.4) is sharp. 


Next we give the more general 


Theorem 6.2. Let f € C"+"({a,b]), n EN, such that f(1) = 0, k =0,2,... 


Then 


eagete | Perc 


T (1, 2) < n+)! 


i (a(2))-" Ip) — q(2)/"*1 dA(2). 
XxX 
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(6.8) 


(6.9) 


(6.10) 


sn. 


(6.11) 
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Inequality (6.11) is sharp. The optimal function is f(y) := \y—1|"t%, a > 1 under 
the condition (i) or (tt) of Theorem 6.1. 


Proof. Let any y € [a,b], then 


mn flr) 
Fly) ~ £0) = Yo PW VE + Rally (6.12) 
where 
Ratu) = fF = £0) Eat (6.13) 


here y>lory<l. 
As in [27], p. 500 we get 


2 gsm eare 


PRa(la)| < ely — apt, (6.14) 
for all y € [a,b]. By theorem’s assumption we have 
fyM=fMOWU-)+Rall.y), ally € [a, 0). (6.15) 
Consequently we obtain 
2) ia = f otore. (1,222) ave 
[or (BS) aww =f der (. 2S) ae. 6.16) 


Therefore 


Py(aisse) < fae) [Ro(1. 42) | ancy 


q(x) 
(by 6.14) [FOI] fa.y pa) _4|"* 
on ee i a(x) | 1] a(n) 
- Traces | Bee 


CES [ (ace eta) - aay ance), 


proving (6.11). Clearly here f is convex, f(1) = 0 and strictly convex at 1, also 
fecr**({a, 0). 
Furthermore 


and 
n+1 


FOV (y) = (n+.a)(n +a —1)-+- (a+ Naly — 1/97! (sign(y — 1) 
Thus 


FOP Y)| = (id a -»] ade, 


j=0 


Basic Optimal Estimation of Csiszar’s f-Divergence 49 


and 
ire ries= (Te +a- | (max(b—1,1—a))°'. (6.17) 
j=0 
Applying f into (6.11) we derive 
7 ? p(x) a nt+a 
L.H.S.(6.11) = I q(x) @ 1 dd 
= [ace @ — q(x)|"** d(x), (6.18) 
and 
MG +a —j)(max(b—1,1-a))°-! 
R.HS.(6.11) = aa 
[ere — q(x)|"** dX(2). (6.19) 
Then 
jim, R.H.S.(6.11) = | (q(x))~" |p(x) — q(x)|"** dX(z). (6.20) 
x 


By applying Lemma 6.3 here, see next, we find 


lim L.HS.(11) = lim [ (q(x))'~"~*|p(a) — a(x)|"** aA(a) 


al x 


= i (a(2))-"|p(2) — 4(2)|"*" ad(2), 
x 


proving the sharpness of (6.11). 


Lemma 6.3. Here a > 1, n € Z4 and condition i) or ii) of Theorem 6.1 is valid. 
Then 


lim f (g(x))'~"~*|p(x) — q(2)|"** dA(z) 


al x 
= f (ale) (a) - 2)!" aX(o). (6.21) 
When n = 0 we have 
Bim (ala))'-* (x) — a(a)|* Aa) = ff w(x) - a(a)|AX(w). (6.22) 
xX 


Proof. i) Let the condition (i) of Theorem 6.1 hold. We have 
p(x) 


a-1<——~-1<b-1, ae. on X. 


q(x) 
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Then 
cS 1] < max ~1 l—a)<1, ae onX 
q(x) 
Hence 
n+a n+1 
Foe < | < | i] <1. a.e. on X, 
q(2) q(x) q(x 
and 
n+a n+1 
lim p(x) = pa) _ » ae. on X 
acl q(x) q(x) 


By Dominated convergence theorem we obtain 


Then notice that 


peo, SN ee). = ieee 
0< / az) |——-1 a i dy 
x q(x) q(x) 
n+1 nt+a 
cof Fone ee 1 d\—0, asa 1 
x) q(x) 
Consequently, 
lim | q(x) P(z) -1 inte dX= | q(a) Ie) 1 Pe dx 
aml J x q(x) x q(x) 
implying (6.21), etc. 
ii) By g< 1 ae. on X we get qrt* < q’*?"1 ae. on X. 


lpg] he Ge? ae on XS 
Here by assumptions obviously g > 0, q'~"~* > 0 ae. on X. Set 
o, = {2 € X: |p(z) — g(a) |"** > (q(x))"F°"F, 
then A(o,) = 0, 
oy = {a € X: q(x) = 0}, 
then A(o,) = 0, 
03 = {a € X: (q(x))'~"* |p(x) — g(x) /"F* > 1}. 
Let xo € 03, then 
(g(x0))'"~“|p(xo) — a(ao)/"F* > 1. 
If q(xo) > 0, i-e. (q(xo))"*°~* > 0 and (q(xo))'~"~* < +00 then 
|p(xo) — a(xo)|"** > (q(ao))"*°™". 


Hence zo € 0. 
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If g(vo) = 0, then x € a5. Clearly o, C 0, Ua and since A(o, Ug.) = 0 we 
get that (a3) = 0. In the case of g(ao) = 0 we get from above oo(p(ao))”** > 1, 
which is only possible if p(ao) > 0. That is, we established that 


(q(x))*""*|p(z) — g(a)|"** <1, ae. on X. 
Furthermore 
lim (q(2x))!-"- [p(x) — a(@)|"** = (a(2))"" pla) — ala)", ae. on X. 
Therefore by Dominated convergence theorem we derive 


i, (q(2))'~"~" |p(x) — q(a)|"*% dd > | (q(x))~"|p(a) — q(x)|"*1 dd, 
ie xX 


as a —> 1, proving (6.21), etc. 


Note 6.4. See that |p — q| <q a.e. on X is equivalent to p < 2q a.e. on X. 
Corollary 6.5. (to Theorem 6.2) Let f € C?({a,b]), such that f(1) =0. Then 


(2) 
Py(yrspa) sel « F (g(a) (2) ~ ate)? AG). (6.23) 
xX 


Inequality (6.23) is sharp. The optimal function is f(y) := |y—1|'t*, a > 1 under 
the condition (i) or (i) of Theorem 6.1. 


Note 6.6. Inequalities (6.4) and (6.11) have some similarity to corresponding ones 
from those of N.S. Barnett et al. in Theorem 1, p. 3 of [102], though there the 
setting is slightly different and the important matter of sharpness is not discussed 
at all. 


Next we connect Csiszar’s f-divergence to the usual first modulus of continuity 
Wy. 


Theorem 6.7. Suppose that 
0<hi= | |p(a) — g(x)| dA(x) < min(1 — a,b—1). (6.24) 
x 
Then 


T(t, 42) S wi(f,h). (6.25) 
Inequality (6.25) is sharp, namely it is attained by f*(x) = |x — 1]. 
Proof. By Lemma 8.1.1, p. 243 of [20] we obtain 


f(a) s BG 4, (6.26) 
for any 0 < 6 < min(1 —a,b—1). Therefore 
wi(f, 6) p(2) 
T5(t1, M2) < tee fae) ae = 1] arco 


= EL Iolo) al ne. 
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By choosing 6 = h we establish (6.25). Notice wi(|~ — 1|,6) = 46, for any 6 > 0. 
Consequently 


Pe-asasse) = fh ate) pte) 


Ht} 1) ax(a) = [ ele) - aay} anco 


ax (1 =, f tele) - ate) an(a)) | 


proving attainability of (6.25). 


Finally we reproduce the essence of Lemma 1.1. of [140] in a much simpler way. 


Theorem 6.8. Let f : (0+ 00) — R convex which is strictly conver at 1,0<a< 
1<b, ax<t<b,ae on X. Then 


Ty (1, #2) = F(1). (6.27) 


Proof Set b:= 2 € R, because by f convexity both f/, exist. Since f is 
convex we get in general that 


f(u) > f0)+0(u-1), Vu € (0,400). 


We prove this last fact: Let 1<u,< ue, then 


f(ui) — fQ) < f(uz) — fQ) 
Uy — 1 7; U2 — 1 , 
hence 
al est A) 1) for u>1 
That is 


Let now wu; < ug <1, then 


f(u1) = FQ) 
u;—1 u2—1 


hence 


Also by convexity f! (1) < fi. (1), and 


That is, we have 
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Let u>1then f(u) > f(1) + f,(1)(u—1) > f(1) + b(u— 1), that is 
f(u) 2 fF) + bu-1 


) for u>l. 
Let u<1 then f(u) > f(1)+ fi ()(u— 1) > f(1) + b(u — J), that is 


Ww 


f(u) > fA) +0(u—1) for u< 1. 

Consequently it holds 
f(u) > f()+o(n—1), Vu € (0,+00), 
proving this important feature of f. Therefore iim Stuy > b. 


We do have 
(2) > fa) +6(2-1), a.e.on X, 
qd qd 


and 


af (2) > fQq+b(p—gq), ae.onXx. 


it af (4) dd > f(1), 


Tp(t1, 2) > f(1). 


The last gives 


which is 


Note 6.9 (to Theorem 6.8). Since f is strictly convex at 1 we get 
with equality at u=1. 


(2) >sayso(4-1), pa 


with equality if p= q;a.e. on X. 


Consequently 


Thus 
qf (2) > f(1)¢+b(p—¢), whenever p¥# g;a.e.0nX, 
q 

and 

Py(u1, M2) > fC), if pa F pe. (6.28) 
Clearly 

Py(Hi,H2) = f(1), if wi A pe. (6.29) 
Note 6.10 (to Theorem 6.8). If f(1)=0, then I p(pu1, ue) > 0, for pr A uo, and 


Tp (p41, f2) = 0, if wa = pr. 
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Chapter 7 


Approximations via Representations of 
Csiszar’s f-Divergence 


The Csiszar’s discrimination is the most essential and general measure for the com- 
parison between two probability measures. Here we provide probabilistic representa- 
tion formulae for it on various very general settings. Then we present tight estimates 
for their remainders involving a variety of norms of the engaged functions. Also are 
given some direct general approximations for the Csiszar’s f-divergence leading to 
very close probabilistic inequalities. We give also many important applications. 
This treatment relies a lot on [43]. 


7.1 Background 


Throughout this chapter we use the following. 

Let f be a convex function from (0,-+00) into R which is strictly convex at 1 
with f(1) = 0. Let (X,A,A) be a measure space, where is a finite or a o-finite 
measure on (X,.A). And let p41, 2 be two probability measures on (X,.A) such that 
bi <A, 2 < » (absolutely continuous), e.g. A = pi + pa. 

Denote by p = te q= tee the (densities) Radon-Nikodym derivatives of 11, 
Lg with respect to A. Here we suppose that 


Coe Sh ae. on X and a<1<b. 
qd 


The quantity 
p(2) 

Pytuasna) = f aad (BS) arc (7.1) 
was introduced by I. Csiszar in 1967, see [140], and is called f-divergence of the prob- 
ability measures 4; and pa. By Lemma 1.1 of [140], the integral (7.1) is well-defined 
and T'y(441, 2) > 0 with equality only when 4; = jg. Furthermore I'p(/11, 2) 
does not depend on the choice of A. The concept of f-divergence was introduced 
first in [139] as a generalization of Kullback’s “information for discrimination” or 
I-divergence (generalized entropy) [244], [243] and of Rényi’s “information gain” 
(I-divergence of order a) [306]. In fact the I-divergence of order 1 equals 


Du logs ul, [l2). 
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The choice f(u) = (uw — 1)? produces again a known measure of difference of 
distributions that is called y?-divergence. Of course the total variation distance 
|H1 — Hal = fx p(x) — 9(x)| dA(z) is equal to T\-1)(H1, M2). 

Here by supposing f(1) = 0 we can consider I y(j11, p2), the f-divergence as a 
measure of the difference between the probability measures j41, 2. The f-divergence 
is in general asymmetric in 41 and pe. But since f is convex and strictly convex at 
1 so is 


fu) = uf (+) (7.2) 
and as in [140] we obtain 


Dy (Ha, Hi) =D y~ (141, Ha). (7.3) 


In Information Theory and Statistics many other divergences are used which are 
special cases of the above general Csiszar f-divergence, e.g. Hellinger distance Dy, 
a-divergence Dy, Bhattacharyya distance Dg, Harmonic distance Dyq, Jeffrey’s 
distance D,, triangular discrimination Da, for all these see, e.g. [102], [152]. The 
problem of finding and estimating the proper distance (or difference or discrim- 
ination) of two probability distributions is one of the major ones in Probability 
Theory. 

The above f-divergence measures in their various forms have been also applied 
to Anthropology, Genetics, Finance, Economics, Political Science, Biology, Approx- 
imation of Probability distributions, Signal Processing and Pattern Recognition. A 
great motivation for this chapter has been the very important monograph on the 
topic by S. Dragomir [152]. 


7.2 All Results 


We present the following 
Theorem 7.1. Let f,g € C'({a,b]) where f as in this chapter, g' #0 over [a,b]. 


Then 
oe, [a,b] h ee f (a5) 7 a) ae me 


Proof. From [112], p. 197, Theorem 27.8, Cauchy’s Mean Value Theorem, we have 
that 


/ 
Tr(a, ba) < IZ 
g 


f(g) — 9(2)) = g' (OF) — £(@), (7.5) 


\g(o) — g(a)I, (7.6) 
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and 
HOMO a ORO! 
F Noe,[a,0] 
for any y, z € [a,b]. Clearly it holds 
p(x) _ ce ) lg (2@) _ 
ace)|s (2B) — sen] s [FI ae |e (BS) - 9 


a.e. on X. The last gives (7.4). 


Examples 7.2 (to Theorem 7.1). 
1) Let g(x) = 4. Then 


q(x 
Py(yisd2) <le?F' loos | Ue) Gh deans 
xX p(x) 
2) Let g(x) = e”. Then 
Ps (itis 2) < lle? F'lleoe.a i (a) 
3) Let g(x) = e~*. Then 
_ pie) ae 
Plus a) < MeFllsoson fale) few 9 — e-| axa, 


4) Let g(x) = Ina. Then 


T(t, #2) $ ll2eF'lleo,to. Hh (a) 


5) Let g(x) = ena, with a>e7!. Then 


i [ 
6) Let g(x) = x. Then 
Dy(4115 2) < 2I| VEF"lloo fat I Jae) Vota) — Vala)| aX(2). 


Then 


/ 


if Ff | rr 
f(a, Ma) < Ife 


7) Let g(x) = 2%, a>1. 
i 


goed 


1 
ii gies 
f (Hi, M2) < 7 


Next we give 


Theorem 7.3. Let f € C'({a,b]). Then 


b 
Py(yns 42) — 5 [ fw 


ata 


i g(a)!" |(p(a))% = (q(@))*| dA(a). 
[a,b] YX 
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(7.9) 


(7.10) 


(7.11) 


(7.12) 


(7.13) 


(7.14) 


(7.15) 


< ies (=) —(arn+ [ ovrane), (7.16) 
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Proof. Let z € [a,b], then by Theorem 22.1, p. 498, [27] we have 


1 (oe 


b 
— | tow-se) 


which is the basic Ostrowski inequality [277]. That is 


1 
b-a 


b 
—qA (2) <af (4) - cf f(y)dy < qA (4) 
a.e. on X. 


Integrating (7.19) against \ we find 


—A(z) < f(z) - 


b 
4 fly)dy < A(z), 


and 


-[4(B)estiouni-p from [ (2) 


That is, 


«fa 
gece Cua) 


= II f’Il-0,[a,0) -1 2 2 
= pe fa ((p — aq)” + (bg — p)*) dA, 


b 
T¢(p1, 2) — —/ f(y)dy 


proving (7.16). 
Note 7.4. In general we have, if |A(z)| < B(z),then 


[sear < [ Ban. 


2(b — a) ) If’ Iloo,fa,b] =: A(z), (7.17) 


(7.18) 


(7.19) 


(7.20) 


(7.21) 


(7.22) 


Remark 7.5. Let f as in this chapter and f € C1({a, bj). Then we have Montoge- 


mery’s identity, see p. 69 of [169], 


b b 
f= f sods — [Pens oat 


for all z € [a,b], and 


Hence 


(7.23) 


(7.24) 
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We get the following representation formula 


b 
DP (yas pa) = — (/ Hat +R) ; (7.25) 


Ry:= iE q(x) ( | " ps (42.1) rot) dX). (7.26) 


Remark 7.6. Let f be as in this chapter and f € C®){a,b]. Then according to 
(2.2), p. 71 of [169], or (4.104), p. 192 of [126] we have 


f(z) = a, f soe (oe) (:- (>) 


b pb 
+ wa | / P*(z,t)P*(t, s) f"(s)dsdt, (7.27) 
(b ra a)? a Ja 
for all z € [a,b], where P* is defined by (7.24). So that 


af (2) = tf sous ae r (2 e (*)) 


: ah [ P* (E.:) P*(t, s)f"(s)dsdt, (7.28) 


a.e. on X. By integration against \ we obtain the representation formula 


Pan = Af soa ae (1 - (*)) ae (7.29) 


where 
Ras [a (/ [ P* (2.1) Pin (ia) dd. (7.30) 


Again from Theorem 2.1, p. 70 of [169], or Theorem 4.17, p. 191 of [126] we see 
that 


where 


for all z € [a,b]. 
Therefore 


a (#)--4 Hoa (S-") (p- a) 


2 
1 Bi( BEB) of 
< 3 qd ee a (b—a)*||f""|o0,[a,b); (7.32) 
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a.e. on X. Using (7.22) we have established that 


b — a a 
Pyne) — pee styar— (=X) (0 st) 
2 BR _ (atb)\? ° 
< OO LF lc. ta fa el a+ . (7.33) 
From p. 183 of [32] we have 


Lemma 7.7. Let f: [a,b] — R be 3-times differentiable on [a,b]. Assume that f’” 
is integrable on [a,b]. Let z € [a,b]. Define 


S—a 


; @leer 
b—a 
P(r, s) = , (7.34) 
a r<s<b; r,s € [a,d. 


Then 


. A [sone a ao i P(2, 81) 
+ (ALO) Pf rte ayPtorsa)dords 


b pb pb 
+f / / P(z, 81)P(s1, 82) P(s2, 83) f'" (s3)ds1ds2ds3. (7.35) 


Remark 7.8. Let f as in this chapter and f ¢ C®)({a,b]). Then from (7.35) we 
obtain 


W(8)= stat EL (en) 
ee LO" a ff (2, a1) P\ (81, 82)ds1ds9 
eee (2. a1) P( (81; 89) P(83,'5a) F" (63) dsydeotlea, (7.36) 


a.e. on X. We have derived the representation formula 


Py (1, 2) = —_ i f(si)dsi + (4) [a ie (Z.s:) in) dd 
a (Se et : 2) f (/ [> (2 a1) P\ fovea) d\+R3, (7.37) 


where 


rom fal ff fr (En) Perera sys@oindtiads) ds 


(7.38) 
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We use also 


Theorem 7.9. ([32]). Let f: [a,b] — R be 3-times differentiable on [a,b]. Assume 
that f’” is measurable and bounded on [a,b]. Let z € [a,b]. Then 


re = ag [sone - ae (:- (>) 


fO=f£@Y [3 (a? + b? + 4ab) 
3 OSES) | eae oe ee) l 
Fe lss tert 4 
Se - A*(z), (7.39) 


1 1 1 1 
A*(z) = abe - ze bz + zu be —ab’z? — zo bz + zu + a7b?2" 


1 1 1 3 3 1 
a*bz3 502 502 | ge Fee tye tee 
26g 3 ogg dogg aes, Soa oleae 

-= -= — — —b 

30% gee tygee tbe + gba 


2 2 1 1 aS be 
bab — Sab? — <a5z — =08z + + =|, A 
i. ee 6 Ge OO ee) 
Inequality (7.39) is attained by 

fl = (2-4) + b= 9) (7.41) 


in that case both sides of inequality equal zero. 


Remark 7.10. Let f be as in this chapter and f € C)([a,]). Then from (7.39) 


we find 
0 (S) staf noven (FEM) (-«(") 


7 (fo- Fe) (E ee ee eee ne ‘wi | 
=r j 
< Uap (2), vs 


a.e. on X, A* is as in (7.40). 
We have proved that (via (7.22)) that 


T(t, 2) — ef teovaes = (o) (1 7 (-=*)) 


IF Iho, fa,0) ; (4) 
5 ae i aA (2) aa, (7.43) 
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where A* is as in (7.40). 

We need a generalization of Montgomery identity. 
Proposition 7.11. ([32]). Let f: [a,b] — R be differentiable on [a,b], f’: [a,b] ~ R 
is integrable on [a,b]. Let g: [a,b] — R be of bounded variation and such that 
g(a) £ g(b). Let z € [a,b]. We define 


g(d)— gay’ “6S 

P(g(z), 9(t)) = Bais (7.44) 
Gl) —g 
Wana 


Then it holds 
b 
1a) =" ay aia) an f £0 (t)dg(t) + | P(g )f'(t)dt. (7.45) 


Remark 7.12. Let ie € en g € C({a, b]) and of bounded variation, g(a) 4 
g(b). The rest are as in this chapter. Then by (7.44) and (7.45) we find 


w(*)-at | " pWdolt) +4 ‘i ‘p (o(4) 0) soa, (746) 


a.e. on X. 
Finally we obtain the representation formula 
1 b 
T (pr, = | ftidg(t) + Re 7.47) 
rl 1 2) g(b) — g(a) i (t) (t) ( 
where 


Rigi [a ([° (s (2) a(t) ro) dy. (7.48) 


Based on the last we have 


Theorem 7.13. In this chapter’s setting, f € C1({a,b]) and g € C({a, b]) of bounded 
variation, g(a) # g(b). Then 


b 
Tp (p41, b2) — OErOnE f(t)dg(t) 


< Uthat fa f P(9(2) 9 ‘)|ee) a a} (7.49) 


Examples 7.14. (to Theorem 7.13). Here f € C1({a,b]) and as in this chapter. 
We take gi(x) = e*, go(x) = Ina, gs(x) = V2, ga(x) = 2%, a > 1; & > 0. We 
elaborate on 


P(gi(x), gi(t)) = t= 1,2,3,4. (7.50) 
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Clearly dgi(x) = e*dax, dgo(x) = 4dx, dg3(x) = ayade, dga(z) = az®—'dz. 
We have 


1) 
e! — e@ 
; 5 Q@Xtog, 
e? — e@ 
P(e*,e') = (7.51) 
peice? 
, “@<t<bd, 
er —-e8 
2) 
Int—Ina 27x 
ho-ing® °° 
P(Ina, Int) (7.52) 
Int —Inb Bileg 
Inb—Ina’ meas 
3) 
vi- va a<t<a, 
vb- Va 
(7.53) 
_ | vie r<t<ob, 
vb- Va 
4) 
t* — a® 
», ast<a, 
b® — a® 
(a, t®) (7.54) 
ate <t<b 
— zt 
Furthermore we derive 
1) 
e' — e® 
Sh SES DG: 
e?—e 
|P(e?/4, e*)| (7.55) 
bot 
ey ERD, 
e" = e% qd 
2) 
ee Ree. 
p Inb—Ilna a) 
P (1m —,In ‘) - (7.56) 
qd 
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3) 
vi- va a<t< > 
a vb- Ja q 
PR) os 
vo~ vi Pp SESH: 
vo-Va 4 
4) 
Bree aa 
pe b& — a® ~~ @ 
P(“e)]= oe 
qd b® — 4% 
; <t<ob 
b® — a® 
Next we get 
1) 


b 
1 
[ pe'.e)la = : = {2erlt — (E-a+1)+e"(v-2-1)}, (7.59) 
a e—e qd qd 


2) 
i p(m,me)| a = (1m ey {2 (m2 -1) —a(Ina— 1) 
— (Ina) (2 -a) + (Inb) (»- 2) — b(Inb— o}, (7.60) 
3) 
[eC E len aol tog) 
(7.61) 
4) 
fPG\je-melaa 
(a® $O), ee (7.62) 


Clearly we find 


1 


eo — e@ 


1 
b 
af \Pleel)idt = =~ (240?! — e%(p—ag-+ 4) + (bq p—a)}, (7.63) 
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2) 
if P (2,1ne)} a = (1m (‘)) {2 (m2 = 1) 
— a(Ina— 1)q— (Ina)(p — aq) + (Inb) (bq — p) — b(Inb — val. (7.64) 
3) 
if p(/2.v4) a= cee {peep — (Vat vb)p+ oa, 
(7.65) 
4) 


pe Qa 1 2 —a, atl 
(28 er a 
()| ote eo 


eek ety (=) (att + wen (7.66) 


a+l 


Finally we obtain 


1) 


b 
1 
| («/ Pte d\ = =—— ={2 | get!tar—et(2—a) + (6-2), 
x a ee x 


| (7.67) 
2 
q ‘|p inZ int )| dt) dr 
{of poem) 
= (in ()) f (f vn2ar) ~In(ab) +0402}, (7.68) 
3) 
fel Pedi 
= Boas i gla — (fat v4 EP. (7.69) 
4) 


pe Qa 1 2 | —a atl Qa a 
P| —,t dt = ——— 4 ———~ dr — b 
()@- melee hee cae 


a atl port 
+ (25) + )b. (7.70) 
Applying (7.49) we derive 
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Tp (41, H2) zal f (t)e'dt 


ee eP/4 —~e*(2—a e(b— 
Sas bf DOO deeb a}, (7.71) 


Py(a pa) = (In (2)) . pa 


Sees (m(2)) fof ome Page in(ad) + (a+ 0-2)}, (7.72) 


2) 


3) 


| ° £( 
Pylisb) ~ ae | ven 
j 3/2 + 23/2 
= IF’ loo, [a,b] {5 [aie'ltan+ Ci os a a (J/a+ vis, (7.73) 


~ (Vb= va) 3 
4) (a > 1) 
T (p41, He) — aw ft He. ldt 
\|f ll oo, a,b] 2 —a,at+l 
- tesa (fron) 
+ (= :) (att! + petty — (at + oh (7.74) 


We need to mention 


Proposition 7.15. ({32]). Let f: [a,b] — R be twice differentiable on [a,b]. The 
second derivative f": [a,b] > R is integrable on [a,b]. Let also g: [a,b] > R be of 
bounded variation, such that g(a) 4 g(b). Let z € [a,b]. The kernel P is defined as 
in Proposition 7.11 (7.44). Then we get 


1 b 
f@) = Ey i f(t)dg(t) 


b b 
v Ga) ( i Plat) ( . raat) 


b b 
f i i P(g(z),g(t)) Pla lt), g(ts)) f" (trata. (7.75) 
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Remark 7.16. Let f € C®)({a,b]), g € C((a, b]) and of bounded variation, g(a) 4 
g(b). The rest are as in this chapter. We have the representation formula 


1 a 1 sare 
Tylana) = oy IVES) = ry gay (/ riya) 


. (/. Ge (o(2).a0) i) a) +Rs, (7.76) 
Rs i= J. («// [? (o(2).09) Pla) ata)."(ayua de. {ETD 


Clearly 
Ral SAF lesctasti 


peewee 


Therefore we get 


P(g (2). 0) |P(g(t), an) ee) a) . (7.78) 


1 b 
Ty (Hi, M2) — aoa F(t)dg(t) 


~gacgan (frees) (f,(0f°#(0(2).00)H) a) 


< | Fear 


(f. Gua P(a(2). 0) Pla) atte a), (7.79) 


We further need to use 
Theorem 7.17. ([32]). Let f: [a,b] — R be n-times differentiable on [a,b], n € N. 
The nth derivative f™: [a,b] > R is integrable on [a,b]. Let z € [a,b]. Define the 
kernel 


P(r, s) = (7.80) 


where r,s € [a,b]. Then 


ee eee Te (f(b) — f(a) 
1) = 72, f ronan +S (MOM) 


b b k 
‘ / of P(z, 81) |] P(si, Sina)dsidse + dspy1 
a a t=1 


(k+1)th—integral 


b b n-1 
+ i: + f P(z, 81) II P(s;, 8:41) f((8n)ds1ds2++-dsp. (7.81) 
a a i=1 


68 PROBABILISTIC INEQUALITIES 


-1 0 
Here and later we agree that [[ e=0, [J e=1. 
k=0 i=1 


Remark 7.18. Let f ¢ C/™ ({a, bj), n € N and as in this chapter. Then by Theorem 
1 ([32]) we obtain the representation formula 


T p(t, f2) = , fst yas (Soo) 


(, (f- [ P(Z, n1) TL Plessis asses} 


+ Re, (7.82) 


where 


1 


Re =f ([ f P(2, “JT T] Pees AM(on) ay) dX. (7.83) 


Clearly we have that 


b b n-1 
Rel < IF Mlososs f o(/ ~ [ }p(25)|T 
x a a qd j=l 


We also need 


|P(si, 8i41)|ds1 anaes is) dx. 
(7.84) 


Theorem 7.19. ([32]). Let f: [a,b] — R be n-times differentiable on [a,b], n € N. 
The nth derivative f: [a,b] + R is integrable on [a,b]. Let g: [a,b] + R be of 
bounded variation and g(a) 4 g(b). Let z € [a,b]. The kernel P is defined as in 
Proposition 1 ([32]). Then 

z)= ———"_| 81)dg(s1) + —.——_~ [senda 
mB ) (g(b) — g(a)) . f( 1) g( 1) (g(b) — g(a)) a ( f (s1) g( 1) 


b b k 
‘i (/ see P(g(z), g(s1)) II P(g(s;), g(Si41))dsidsq cee inn) 


n—2 


b b 
+f +f P(o(z),9(s1)) |] PCo(si), 9(si41))f (8n)ds1-++d8n. (7.85) 


Remark 7.20. Let f € C\([a,b]), n € N and g € C({a,b]) and of bounded 
variation, g(a) # g(b). The rest are as in this chapter. Then by Theorem 2 ([32]) 
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we obtain the representation formula 
Ty(n0a) = may | J@iae 
ee ee Oc er 
* @O-ae) 2 (/ ee ») 
(he eo) a) 


k 
TT Pots): on) send + R7, (7.86) 
i=1 


where 


n—-1 
- TT Posi), 9(si41)) fF (sn)dar- + ts) dd. (7.87) 
Clearly we get 


Rel < Union foal fon f P(a(2).0(s)| 


, II |P(9(si), 9(se41))|dsi sya (7.88) 


Next we give some Lg results, a > 1. 


Remark 7.21. We repeat from Remark 7.5 the remainder(7.26), 


Rii= [we G P* (22.) rot) dX(x), 


where f is as in this chapter and f € C1({a,b]), P* is given by (7.24). Let a, 3 > 1: 
+ + 3 =1. Then 


Ral < IF llajoos fa) P (..) locos aX(2) (7.89) 
nd 
, Ral <I 'Ihvou face) > (45.-) Mas a2). (7.90) 


Similarly we obtain for Re (see (7.30)) of Remark 7.6, here f € C')([a, b]) and 
as in this chapter that 


b 
[Ral < I7"lleiou f q (/ 
xX a 


P* (E.t) | |P*(t, Mae dd, (7.91) 
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and 


b 
(Rel < [lf lla ja. i i ( / 
x a 


Similarly we get for R3 (see (7.38)) of Remark 7.8, here f € C)([a, b]) and as 
in this chapter, P as in (7.34), that 


b 
|R3| < I7"llaioo f q ( 
x a 


and 


b 
Ral <IF""llato.g4 | ; ( 
x a 


Next let things be as in Remark 7.12 and R4 as in (7.48). Then 


Ral <IF'laseay f ato) P ((22).00) i az) (7.98) 


P* (2.1) | \|P* (t, atest di. (7.92) 


b 
P(2.s1] | |P(s1, 82)| || P(s2, Dinietts) dX 
(7.93) 


b 
P(E.) | |P(s1, 82)| || P(se, iene) dA. 
(7.94) 


x 
and 
p(x 
Ral < slow f ae)|P(o(4) 9°) | ae. (796) 
xX q(x) oo, [a,b] 
Let things be as in Remark 7.16 and Rs as in (7.77). Then 


b 
IRsl < If llesta.t | o( i 
XxX a 


P(a(2).000) | \Plat0. alos dy (7.97) 
and 


sl <M" hoo f ( | P(a(2).0)| |Past dy. (7.98) 


Let things be as in Remark 7.18 and R¢ as in (7.83). Then 


Rel < [Ff llasa.o I (f' lP(2, »)| 


n-2 
(1 Pts |P(sn—15°)lle,,a,0)¢81 °° aan) d\, (7.99) 
i=1 


Rol < [lf Ula tao iL a(f' |P(2.)| 


(TI |P( 85) 5:44) Yt Sn-15° Mee; [a,b] 451° sya) (7.100) 


and 
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Finally let things be as in Remark 7.20 and R7 as in (7.87). Then 


Ral < IF lta. if xo( : ‘|p (« (HS) .a(s1)) 
, (TL rtatsd.s¢s 


I|P(9(sn—1), 90-))Iha.fa,)481 °- aan) dX(x), (7.101) 


Ral <M lhoas fa af "| (9 (22) on) 
(Tires tsa)! 


II P(g(sn—1), 90) leo. fa,6) 481° im] dX(x). (7.102) 


and 


At the end we simplify and expand results of Remark 7.21 in the following 


Remark 7.22. Here 0 < a < 6 with 1 € [a,6], and 4 € [a,b] a.e. on X. 
i) We simplify (7.90). We notice that 


P (£.+)| - _— oe (7.103) 
here P* is as in (7.24). Then 
IP dw ee Gaeta) 99 
Hence it holds 
Ral < Uflsjoay fae) max (BE — a,b B22) arc (7.105) 


Here f is as in this chapter and f € C'([a, )}). 

ii) Next we simplify (7.96). Take here g strictly increasing and continuous over 
[a,b], e.g. g(x) = e”, Ina, Vz, x* with a > 1. Also f € C1({a,6]) and as in this 
chapter. Then clearly we derive 


Ral < ap [oe max (s (He) ~ 9(a), 90) -¢ (H)) a 


In particular we obtain 
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1) 
[Ra| < —— I, aes I q(a) max (cm ee a oe ') dX(x), (7.107) 
2) 
|Ra| < < [Ethos oT I q(a) max (1 (A) —Ina,Inb—In (42 )) dX(x), 
(7.108) 
3) 


[Ral < aoe ae) max ( | oe — Ja, Vb— Hi) dX(x), (7.109) 


and finally for a > 1 we get 
4) 


/ Qa (ey 
|R4| << ——— LF Haan Lf q(x) max (2 (2) —a™,b* — . =) dX(x). (7.110) 
‘beat Jy q* (2) q°(x) 
iii) Finally we simplify (7.89). 
Let a, 3 > 1 such that rage =1. Also f € C'({a,b]) and as in this chapter. 
We obtain that 


B 
Pe (Et) = (7.111) 
j Pp 
(b— te Gil 
q 
and 
b B (2 a yet of: (b- ee 
x [P q q 
P*(*,t|| dt=-+ —"____—- _? _, 7.112 
[ (! ) Ad ( ) 
Thus 


q(x)’ (B+ 1)q(a)?*> 


Here notice g = 0 a.e. on X. So we need only work with g(x) > 0. So all in (iii) 
make sense. Therefore 


ae) Ir Coole 7 \ aac am ee 


a.e. on X. So we have established that 


Ril < IZ'loon f (\ es dX(x). (7.115) 


lr aa | ae (ee (7.113) 


Chapter 8 


Sharp High Degree Estimation of 
Csiszar’s f-Divergence 


In this chapter are presented various sharp and nearly optimal probabilistic inequal- 
ities giving the high order approximation of Csiszar’s f-divergence between two 
probability measures, which is the most essential and general tool for their com- 
parison. The above are done through Taylor’s formula, generalized Taylor-Widder’s 
formula, an alternative recent expansion formula. Based on these we give many rep- 
resentation formulae of Csiszar’s distance, then we estimate in all directions their 
remainders by using either the norms approach or the modulus of continuity way. 
Most of the last probabilistic estimates are sharp or nearly sharp, attained by basic 
simple functions. This treatment relies on [41]. 


8.1 Background 


Throughout this chapter we use the following. Let f be a convex function from 
(0, +00) into R which is strictly convex at 1 with f(1) = 0. 

Let (X,A,.) be a measure space, where A is a finite or a o-finite measure on 
(X,A). And let 41, “2 be two probability measures on (X,.A) such that pi <A, 
pig <A (absolutely continuous), e.g. A = pi + pe. 

dur 


Denote by p= 4, q= ue the (densities) Radon-Nikodym derivatives of 11, 


Lg with respect to A. Here we suppose that 


(2ee= SE ae. on X and a<1<b. 
qd 


The quantity 


Pytuasna) = faa) (BAR) arc, (8.1) 


was introduced by I. Csiszar in 1967, see [140], and is called f-divergence of the 
probability measures jz; and [l2. 

By Lemma 1.1 of [140], the integral (8.1) is well-defined and T'y(j11, w2) > 0 with 
equality only when jy = #12. Furthermore I'¢(j11, 12) does not depend on the choice 
of \.The concept of f-divergence was introduced first in [139] as a generalization of 
Kullback’s information for discrimination or I-divergence “(generalized entropy )” 
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[244], [243] and of Rényi’s information gain (I-divergence of order a) [306]. In fact 
the I-divergence of order 1 equals 


Du logs ul, [i2). 


The choice f(u) = (u— 1)? produces again a known measure of difference of distri- 
butions that is called y?-divergence. Of course the total variation distance 


hae i box) — a(2)| dd(a) 


is equal to P),_1)(41, #2). Here by assuming f(1) = 0 we can consider T's (p11, 2), 
the f-divergence as a measure of the difference between the probability measures 
M1, 2. 

The f-divergence is in general asymmetric in fz; and jg. But since f is convex 
and strictly convex at 1 so is 


pu) =us (4) (8.2 
and as in [140] we find 


Ty (Ha, Hi) =D yp» (11, Ha). (8.3) 


In Information Theory and Statistics many other divergences are used which are 
special cases of the above general Csiszar f-divergence, e.g. Hellinger distance Dy, 
a-divergence D,, Bhattacharyya distance Dg, Harmonic distance Dyq, Jeffrey’s 
distance Dz, triangular discrimination Da, for all these see, e.g. [102], [152]. 

The problem of finding and estimating the proper distance (or difference or 
discrimination ) of two probability distributions is one of the major ones in Prob- 
ability Theory. The above f-divergence measures in their various forms have been 
also applied to Anthropology, Genetics, Finance, Economics, Political Science, Bi- 
ology, Approximation of Probability distributions, Signal Processing and Pattern 
Recognition. 

A great motivation for this chapter has been the very important monograph on 
the topic by S. Dragomir [152]. 


8.2 Results Based on Taylor’s Formula 


We give the following 

Theorem 8.1. LetO0 <a<1< 0b, f as in this chapter and f € C"([a,b]), n > 1 
with | f(t) — f(1)| be a convex function in t. Let 0 < h < min(1—a,b—1) be 
fixed. Then 


()( (n) fA 
g(a, 2) 0 et fatto aptaa SEED fon alrtan 
(8.4) 
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Here w, is the usual first modulus of continuity, w;(f(,h) := sup{|f (a) — 
FM) ey € [2,8], a —y| Sh}. If fO(L) =0, k= 2,...,n, then 


Py(uisaa) << BED f onp—ght*an (85) 
Inequalities (8.4) and (8.5) when n is even are attained by 
f®:= ae laa a<t<b. (8.6) 
(n+ 1)! 
Example 8.2. The function 
g(x) :=|r-1|"T*, neEN, a>l (8.7) 
is convex and strictly convex at 1, also g™)(1) = 0, k = 0,1,2,...,n. Furthermore 


(g(2))™ =(n+a)(n+a-1)++-(a + le — 11" (sign(e — 1)" 
with 
Ig(z)™ | = (n+ a)(n+a-1)---(a+1)|x - 1] 
being convex. That is g fulfills all the assumptions of Theorem 8.1. 


Proof. Here we have (f(1) = 0) 


p= FB ye ss, (8.8) 
k=1 ? 


i= A Ch ett ({ ven) — FO) dty <> a) (8.9) 


By Lemma 8.1.1 of [20], p. 243 we have that 


Core eC wi(f™, h) 
Pr Oss SS 


where 


lt — 1], (8.10) 


and 
wn (fh) [é- 1]? 


< —————__... 
els ae 


(8.11) 


We observe by (8.8) that 
™ fl) 
af (2) = F094 aoa + lay (8.12) 


true a.e. on X. 

Here gq #0 a.e. on X. Integrating (8.12) against 4 and using (8.11) we derive 
(8.4). Notice that for n even f(”)(t) = |t—1| and f(1) = 0 for all k =0,1,...,n, 
with || f(+||,, = 1. Furthermore, 


wi(f™,h) =h. 
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In both cases in (8.4) and (8.5) their two sides equal 


1 = n 
aa [4 |p —q| van (8.13) 


proving attainability and sharpness of (8.4) and (8.5). 


We continue with the general 


Theorem 8.3. LetO<a<1< 0b, f as in this chapter and f € C”([a,b]), n > 1. 
Suppose that w1(f™,5) <w, where 0 <6 <b—a, w>0. Let x € R and denote by 


bn(z) = a H att (8.14) 


where [-] is the ay . the number. Then 


r (M15 H2) < oP Ol Pol al p- aaa) +w fe adn (4) dy. (8.15) 


k=2 


Inequality (8.15) is sharp, namely it is attained by the function 


fr(t) = weon(t-1), a<t<b, (8.16) 
when n is even. 
Proof. One has that 
nr fk) 
fay= f ao (x —1)* + R(z), (8.17) 
k=1 : 
for all x € [a,b], where 
. n n (x = i 
Rew) = f (FW — FO) ae (8.18) 
From p. 3, see (8) of [29] we obtain 
[R(a)| < wi(f, d)}n(# — 1) < won(w 1). (8.19) 
By (8.17) we get 
P\ _ wf) (p_,\" P 
HG) Le a) ea) om 


k=1 


a.e. on X. Furthermore we find 


mn #(k) 
Dy(s1, 42) = >> u — [ato-atas far (2) dX. (8.21) 


k=2 


Using now (8.19) and taking absolute value over (8.21) we derive (8.15). 
Next we prove the sharpness of (8.15). According to Remark 7.1.3 of [20], 
pp. 210-211 we have that 


=) gn_i(t)dt, x E€Ry, n>1, (8.22) 
0 
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where 
do(t) := a , teR (8.23) 
Clearly 
b*) (x) = (sign(z))"¢n—n(v), k=1,2,...,n, cER (8.24) 
That is 
f‘*) (x) = (sign(a —1))"fr_z(x), 2 ER, k=1,...,n. (8.25) 
Furthermore 


f= 0; B= 0, tosecgns 


Since the function ¢, is even and convex over R, then we have that = is convex Over 
R, n> 1. Also since ¢,, is strictly increasing over R+, then = is strictly increasing 
over [1,00) and strictly decreasing over (—0o, 1]. Thus, fp is strictly convex at 1, 
n> 1. Here since n is even 


le — 1 


ft(a) = w | 
and by Lemma 7.1.1, see p. 208 of [20] we find that 
wi( fx, 6) < w. 
So fe fulfills basically all the assumptions of Theorem 8.3. Clearly then both sides 


of (8.15) equal to 
wf an (4) a, 
xX qd 


establishing that (8.15) is attained. 


Next we see that w 4] can be arbitrarily close approximated by a sequence 


of continuous functions that fulfill the assumptions of Theorem 8.3. More precisely 
ene is a step function over R which takes the values 7 = 1,2,..., respectively, 
over the intervals [1 — (j +1)d, 1 — 6) and (1+ (j — 1)6, 1+ 76] with upward jumps 


of size 1, therefore 
Fn) (4) — jé-1 
Ay =| 


has finitely many discontinuities over [a, b] and is approximated by continuous func- 
tions as follows. 
Here for N € N we define the continuous functions over R, 
Nult| N 2 
kw (1-— if kd < jt} << [k+— 
35 + hw ( x): ith <iel< (kez) A 
fon (t) = (8.26) 


(k + lw, if (i+ x) 5 <|t| <(k+ 16, 
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where k € Z,. Clearly fon(|t|) > 0 with fonw(0) = 0 and fon is increasing over Ry 
and decreasing over R_. Also we define 


fan (t) = of" (f° se (f° fow(tn)atn) ) dt, (8.27) 


fort € R. Clearly fxn € C"(R), is even and convex over R and is strictly increasing 
on Ry. 
According to [20], pp. 218-219 we obtain 


: eh ed 
lim foy(le=1)) = w [SA j , fort € [a,b], NEN, (8.28) 
with 
wi(fow(|-—1|),6) = #1(fon(|-|),46) < w. (8.29) 
Furthermore it invalid that 
lim, fn (It = 1) = fib, tor #0; (8.30) 


Let g(x) = fnn(x —1), « € R, then g)(x) = (sign(a — 1))* fin_nyw (2 — 1), 
zéR,k=1,2,...,n, g(1) = 0 for k = 0,1,...,n, and for all x € R it holds 
g(x) = fow(a —1) since n is even. Furthermore g € C"(R), g convex on R, 
g strictly increasing on [1,+00) and g strictly decreasing on (—oo, 1], therefore 
g is strictly convex at 1. That is g with all the above properties and (8.29), is 
consistent with the assumptions of Theorem 8.3. That is, fr(t) is arbitrarily close 
approximated consistently with the assumptions of Theorem 8.3. 


Next we have 


Corollary 8.4. (to Theorem 8.3). It holds (0 < 6 < b—a) 


al r 
T'¢(u1, M2) < alfa p—q) ay 
ie (n) —n n+1 l-n), in 
wilt as aaa fore antaat sy fat pala 
<a ferp-arta (8.31) 
8(n— 1)! Sx ' 
and 
"fe 
T's (vins #2) < ae i iw ahaa| (8.32) 
k=2 . x 


1 —7n n i —n n 
+ numof 5 [a Ip —q| O+ say fa |p — q| “ah. 
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In particular we have 


1 1 6 
Pyuista) Salt’) 35 f Mo aPars 5 f p—aldr+e}, (633) 
25 D Je 8 
and 
1 
Trina) Sat d){ f w-aats f o-arab. 34) 
xX xX 
Proof. From [20], p. 210, inequality (7.1.18) we have 


el BE cele 


with equality only at « = 0. We obtain that (see (8.18) and (8.19)) 


n+l n n-1 
|R(x)| < wi(f™, 5) Comm wt + aera <w(--:), (8.36) 
with equality at «= 1. 
Also from [20], p. 217, inequality (7.2.9) we get 
Ra] sen 6.) (14 ROE), 
Sw); (8.37) 


with equality at x = 1. Hence by (8.17), (8.18) and (8.36) we find 


ik: fH 1)| id = jr 
== (n) gp 4 


l-n n 2—-n n—-1 
q"\p—aq|" | dg "|p—4 
ays ts Be dd |, (8.38) 


etc. Also by (8.17), (8.18) and (8.37) we derive 


pits H2) oe: fa aaa 


+ wi(f, 6) (/ pour : (o> + foeet =) in) , (8.39) 


etc. 


We give 
Corollary 8.5. (to Theorem 8.3). Suppose here that f™ is of Lipschitz type of 
order a, 0 <a<1, ie. 
wi(f™, 5) < Kd%, K>0, (8.40) 
for ae Then 


i ae uss 
f (ia, ML Bes | fa *(p aytan| + Hl 


ati) 


| q **|p—q|"tdn. 
xX 


i=1 


(8.41) 
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When n is even (8.41) is attained by f*(x) = cla —1|"*%, where c:= K/( 
i)) >0. 


(a+ 
1 


n 


2 


Proof. We have again that 


where 


Riz) = i fo fen) = FA) dn a, (8.42) 


From [29], p. 6 see (29)*, we get 


7s 1 nta 
Ra@jsK EO (8.43) 
II (a+) 
w=1 
(E.g. if f € C"*([a,b]), then K = || f*)||,, and a =1). Then 
Kq|2 -1|"** K 
: R(2)| S$ ine et ia > p—al"**, (8.44) 
a Il (a +i) Il(a+%) 
i=l i=1 
a.e. on X, and 
K 
i q R(2) | dX < _—— | gi "4p — q|"t da. (8.45) 
oe q (a+i) °* 
w=1 


Thus we have established (8.41). 

Clearly (8.41) is attained by f* which is acceptable to Theorem 8.3 requirements. 
Notice that f*(1) = 0, 1 = 0,1,...,n and f*™(x) = K|a — 1|% fulfills (8.40) 
in here by (105) of [29], p. 28. See also from [29], p. 28 at the bottom that 
f*(x) = |R(a)|, ete. 


Corollary 8.6. (to Theorem 8.3). Suppose that 


b—a> | q ‘p’dA—1>0. (8.46) 
x 
Then 


Ps(s41, #42) Serr G ( ‘A r'p'd~1)) { I pada 5h. (8.47) 


Proof. From (8.34) by choosing 


b= f r'p-arar= [ Gg. p OX 1; 
x x 
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Corollary 8.7. (to Theorem 8.3). Suppose (1.46), that is 
b—a> | q ‘p’?dA—1>0, 
x 


lp—q| <1 ae. on X, (8.48) 
and 


MX) < oo. (8.49) 


Tylana) < (x) +5) an (7 (f etvrar—1)). 6.50) 


Proof. From (8.47) and (8.48). 


Then 


Note. In our setting observe that always 


| lg oe ees (8.51) 
x 
and of course 
a< i) gq ‘p*dr < b. (8.52) 
x 
We present 
Corollary 8.8. (to Theorem 8.3). Suppose that 
| -alar>o. (8.53) 
x 
Then 
; b—a 
Py(u1, 2) Suit f’, | Ip—alda |p — q|dA + —— 
Xx x 2 
<30—ajwi (sf pala). (8.54) 
x 


Proof. Notice that 


= a Pp 
62g) 'p-a? = p—al|® 1] < @-a)lp~ al aon 


Hence 
[ te@-oarso-a) | paar. (8.55) 
x x 
We use (8.34), (8.53), (8.55) and we choose 
6:= | lp — q|dA < b—a, (8.56) 
x 
etc. 


Notice that (8.54) is good only for small b—a > 0. 
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Next we derive 


Corollary 8.9. (to Theorem 8.3). Assume that r > 0 and (8.46)- (8.49), respec- 


tively, 
bar rip'ds—1) > 0, 
x 
lp—q| <1 ae. on X, 
and 
A(X) < 00. 
Then 


Pytinna) < AO + 5) (Por(f atvtar—1)). 57 


Proof. Here we choose in (8.34) that 


b:= ae gq ‘(p—q)*dX, (8.58) 


etc. 


Remark 8.10. (on Theorem 8.1). From (8.5) for n = 1 we obtain 


Pyuista) <A) fg tip—g)?aa, (8.59) 


which is attained by cy. Assuming that 
0<hi= | q_'(p— q)?dX < min(1 — a,b—1), (8.60) 
x 
we derive the inequality 


Tylana) <0 (Y's ff etvtar—1)). (8.61) 


The counterpart of the above is 


Proposition 8.11. Let f and the rest as in this chapter, also assume f € C({a, 6]). 
i) Suppose (8.53), 


if lp — q|dA > 0. 
x 
Then 
Renee (1. i, p~ aida). (8.62) 
Xx 
ii) Let r > 0 and 


base | lp —q|dA > 0. (8.63) 
x 
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Then 
Py(uasne) < (142) -on (tr f paler). (8.64) 


Proof. We observe that 


Py(mina) = f af (2) ar= [af (2) ar = f(1) 
[i ()-m0)ae f(a 


Bes, 
(by Lemma 7.1.1 of [20], p. 208) | qui(f,h) |" 7 | dr 
x 


wi (f, h) (f(A) a) <ocen (145 fp alea). 


That is, we obtain 


Fea (1 +f w- odd) | (8.65) 


I 


IA 


IA 


Setting 
h:= ih |p — qldX (8.66) 
x 
we find 


T(t, M2) < 2uri(f,h), 


proving (8.62). 
By setting 


hi= rf lp—qldA, r>0 (8.67) 
x 
we derive 


Ty(aspa) Seei(Fsn) (142), (8.68) 


Also we give 


Proposition 8.12. Let f as in this setting and f is a Lipschitz function of order 
a,O0<a<1l, i.e. there exists K >0 such that 


f(z) -f@ |< Kl2z—yl*, all z,y € [a, 8. (8.69) 
Then 


Py(uayt) <K f @*\p—alead. (8.70) 
x 
Proof. As in the proof of Proposition 8.11 we have 


P(t) < f alr (2) Saul) ans | ox |B a=K [ q'* |p — q|°d. 
xX qd xX qd xX 
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8.3 Results Based on Generalized Taylor—-Widder Formula 
(see also [339], [29]) 


Let f, uo, U1,.--,Un € C"**([a, b]), n € Z,, and let the Wronskians 
Wi(a) := W[uo(x), ui(x),..., ui(x)] : 


uo(@) ur (2) u;(«) 
u(x) — uy (x) uj (x) 

= , t=0,1,...,n, (8.71) 
uy (a) uy?(a) =» u{?(2) 

and suppose that W;(z) are positive over [a,b]. Clearly the functions 
W. 
do(x) := Wo(x) := u(x), o1(#) = ane fo, 
g(a) := ule ee (8.72) 


(Wi-1(x))? ’ 
are positive everywhere on [a,b]. Consider the linear differential operator of order 
i>0 
W{uo(2), u1(z), uae) ui-1(), f(z)] 


Li f(a) := 8.73 
f(z) Wile) (8.73) 
for i=1,...,n+1,where Lo f(x) := f(x) all x € [a,b]. 

Here W[uo(x), ui(z),...,Us—1(2), f(a)] denotes the Wronskian of uo, wi,..., Wi—1, 


f. Notice that fori =1,...,n+1, we obtain 


7 @ se aby AP as, sd. the 
( 


8.74) 


Consider also the functions 


uo(t) ur (t) u(t) 
. up (t) u(t) u;(t) 
i(a,t) := ; 
g(@,t) = ) a ies _ (8.75) 
up (é) up (uy (6) 
uo(@) —ui(@) ss u(x) 
for 1 = 1,2,...,n, where go(z, t) := pain all x,t € [a,b]. Notice that g;(x,t), as a 
function of x, is a linear combination of uo(x), ui(ax),...,ui(x) and, furthermore, 
it holds 
(x,t) = —————__ oi (a 
HO Feb J, OY 
. f ba(ns) [ bi(x4)daidx;_1-+-dry 
t t 
1 ifs 
= —— | $0(s)::: ¢i(s)gi-1(a, s)ds 8.76 
dot) dil) Jy o(s) (s)gi-1(2, 8) ( ) 
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alli =1,2,...,n. Put 


Nn(x,t) = / Gn(a, s)ds. (8.77) 
t 
Example 8.13. Let u;(x) = x’, i=0,1,...,n, be defined on [a,b]. Then L; f(t) = 


fO(t), i=1,...,n+1 and g;(z,t) = coe for all x,t € [a,b] andi =0,1,...,n. 
From [339, Theorem II, p. 138], we obtain 


fla) = So Lif(t)- oe.) + [F snles) Enaafls)ds (8.78) 
all x,t € [a,b], n € cts is the Taylor-Widder formula. Thus we have 
Fla) = So Lif O)ailast) + Enea f(O) Nolet) + Ret), (8.79) 
where ~ 
(0,0) = f° onl, s)(Ensi f(s) - Enaaf(O)ds, 0 € Ze. (8.80) 


Here we use a lot the above concepts. From the above definitions, we observe 
the following 


9n(z, t) > 0, x>t,n=1, 
Gn(a,t) <0, uw <t,n odd, 
Gn(a,t) >0, aw <t,n even, st) 
gn(t, t) = 0, n= 1, go(x, t) >0 
for all x,t € [a, }]. 
Since f(1) = 0 we have 
f(v) = 0 Lif (Vga, 1) + Las f() Nala, 1) + RE(a, 1), (8.82) 
i=1 
where 
Reet) =f gnlt,8)(Enai f(s) — Loe f(1))ds, me Ze, (8.83) 
i 
and 
N(x, 1) =| Gn(a,s)ds, x € [a,b]. (8.84) 
1 
Let 
wi(Lnaifh)<w, O<h<b—-a, w>0. (8.85) 
Then by Theorem 23, p. 18 of [29] we get that 
=a Sil 
Race) sw[ 2 ten] sw(1+ 4) ine), (686) 
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Working in (8.82) we observe that 


f (2) = Yo bs(tan (2.1) + Intif(MaNn (2 1) + qR* (2 1), a.e. on X, 


(8.87) 
That is we have established the representation formula 


p(d1, Hz) = Yuu ) | a9 (2.1) d+ Lnasf(l) f aNn (E1)asn, 


(8.88) 


Ti =| qR* (2.1) di. (8.89) 
xX q 
From (8.86) we get 


rie fope ias fo| (8) 
<w ff Ga p= 1) In, (2 1) a. (8.90) 


Based on the above we have now established 


where 


Theorem 8.14. It holds 


P p(y) )s Linge if fas: (2.2) aa} + marsala (2,1) 09 


coe HY In, (2. 1) ar (8.91) 


wi(Lnosf, ) < Aé®, (8.92) 


Next let 


0<d<b-—a,A>0,0<a<1. Then from Theorem 25, p. 19 of [29] we obtain 
[Rn (x, t)| < Ala — t|"|Nn (a, t)], (8.93) 


for all x,t € [a,b] and n > 0. Hence 


q rR (2.1)| < Aq’*|p—q|*|Nn (2 1) (8.94) 
q q 
a.e. on X, and 
nis f af (E.1)]ar <A f gpa" |™ (E.1)|ar. (8.95) 
x q xX q 


Thus we have proved 
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Theorem 8.15.. It holds 
Pp 
s(t) < Dis ray] f ao: (2.2) aa] + iensarcat] f ante (2.1) a 
x 
4 4 gq: “|p — q|“|Nn (E.1)|ar. (8.96) 
xX q 
From Theorem 26, p. 19 of [29] we have: let 1 € (a,b) and choose 0 < h < 


min(1 — a,b—1), assuming that |Znii f(x) — Ln4if(1)| is convex in x € [a,b] and 
calling w := wi(Lniif,h) we get 


" w 


for all x € [a, 6] and n > 0. Therefore 


af (£.1)| <—\p-d| , (2.1) 
q h q 
nl< = i) pal] (E.1)|a. (8.99) 
h Jx q 


We have established 


, ae. on x. (8.98) 


So that 


Theorem 8.16. Jt holds 
i) 


(drs 42) <b lssi if a9: (2 1) ax] + [one f(I] f Nn( 2.1) aa) 
xX 


7 pal], (E.1)]a. n>0. (8.100) 
h Jx qd 
ii) Choosing and supposing 
O<hi= i. Ip — q| , (2.1) | d\ < min(1 —a,b—1), (8.101) 
x qd 
we derive 
Py(y11,42) < ye sl] f an (2.1) a) + Lone stol| f antn( 2.1) ay 
+0} (tnt, f Ip — al I, (2.1) | in) (8.102) 
xX 


Next by Lemmas 11.1.5, 11.1.6 of [20], p. 345 we have: 
Let a < 1 < b, uo(x) := c > 0, ui (x) concave for x < 1, and ui(x) convex for 
x>1. Let 


G,,(a,1) :=|Gr(a, 1), (8.103) 
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where 


Gea e= if Ee a dss 


all x € [a,b],0<h<b-—a,n>0. Then 


x Gs Ga) Gnas) 
< -. 
G,(a, 1) nn a a er |x — 1] =: 6|a — 1], 


all x € [a,b], n> 1. 
But from Theorem 21, p. 16 of [29] we get 


[Ri(#,1)| < wGn(c, 1), 


all x € [a,b], n € Z,, with wi (Inyif,h) <w,0<h<b-avw>0O. 
Consequently we obtain 


[Rn (@;1)| S$ wa — 1], 


all x € [a,b], n > 1. 
Furthermore it holds 


mis [ afr; (2.1) /asuo f lp—qldA, n>1. 
x qd x 


We have established 
Theorem 8.17. It holds 


Py(aasaa) < Soles] fae (2.1) a 
i=l 


+ [Ens f(D)| if wn (E.1)ea| +00 [ erie es 
xX q xX 


where 


_ Gn(b,1) Gn(a,1) 
0 = Max 5 =f 5 ee ; 
with 1 € (a,b). 
It follows 


Corollary 8.18. (to Theorem 8.17). It holds 


Py(is1, 42) < [Laf(0)| | [ an (2.1) ay + [bof (0) | [om (2.1) ay 


a (tas, iE p— aa) : ( I p— aaa) 
. [ns Se Sit) 
b—-1’ l-a ; 


(8.104) 


(8.105) 


(8.106) 


(8.107) 


(8.108) 


(8.109) 


(8.110) 


(8.111) 
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under the assumption that 


| lp — q|dA > 0. 
x 
From (8.78) we have 
= So Lif(gila, 1) + Ru (2), 


i=l 


where 
Ry(t) = i dn (ts 8) Emaar f(s)ds 


all x € [a,b], n € Zy. Set also 
N*(a,1) := if Gade 
1 


[Rw (2) < |[Lnt1flloo,fa,o]! Nn (#, 1)I- 


Pp x (P 
, Rw (4) A ncatlaiies [; (2.1) 


Then we observe that 


Hence 


and 


| q Rw (2) | dA < [Ena flloo) f q I; (2 1) | dA. 
6 q x q 


We have proved 
Theorem 8.19. Jt holds 


Ts(s1, /2) )< lesa nf on (4, 1) ay 


x [P 
+ |[Ln+1f lloo,ta,0) [a|ns eal 
x q 


Next from (8.113) we obtain 


Rw(2)| < | [ lEnniti(as 


sup |9n(z,°)I. 
s€ [a,b] 


Furthermore it is true that 


[Rw(%)| < |Lntifllafa,oll 9n(Z,“)Iloo,fa,0]- 
Thus 


il Re (2 )| < |[Lntiflla fa |g 


mI 
q oo, [a,b] 


89 


(8.112) 


(8.113) 


(8.114) 


(8.115) 


(8.116) 


(8.117) 


(8.118) 


(8.119) 


(8.120) 
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q|Rw AX < |[Lnsiflli fay [4 
Be 


We have established 


and 


In (2 :) | dd. (8.121) 
q oo, [a,b] 


Theorem 8.20. Jt holds 
T'p(u1, 42) < Ye fl nf q9: (2.1) ay 


ole an” (8.122) 


+ |[Lntif lla fa,oj [alo 


Next we observe that for a, @ > 1 such that + + 3 = 1 we have 


[Rw(z)| < |Lntiflla,faellon(&;-)Ils,fa,e)- (8.123) 
We derive 


Theorem 8.21. Jt holds 


TP p(t1, /2) )< insu iff on (2, 1) ay 


See (/ alo» (2.-)| a), (8.124) 
x q GB, [a,b] 


In the following we use Lemmas 12.1.4 and 12.1.5, p. 368 of [20]. 
Here uo(x) = c > 0, w(x) concave for x < 1 and convex for x > 1. We put 


Nn(2,1) = |Na(2,1)|, (8.125) 
where 
Nn(a, 1) =) 9n(x, 8)ds, (8.126) 
1 
all x € [a, bj], n > 0. Let 1 € (a,b) then 
Nn(a,1) < Ye — 11, (8.127) 
all x € [a,b], n > 1 where 
Nn(b,1) Nn(a,1) 
= —_——_—., ——— >. 12 
w ms te ea (8.128) 


Here we take n even then g,,(x, 8) > 0, all x, s € [a,b]. Clearly then we have 


[Rw(x)| < ||Ln+1f lloo,fa,b] if Gn(, 8)ds| = ||Ln+1flloo,{a,)Nn(a, 1) 
1 


< UlLntifllecfail@ — 1, all # € (a, 8]. (8.129) 


We have proved 
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Theorem 8.22. Here 1 € (a,b), n even. It holds 
f(H1, M2) <¥I Lif(1)| if. 19: (2.1) an 
lEnsafloioa (fi aaa). (8.130) 


Remark 8.23. Here again we use (8.78). Let « > t then by (8.81) gn(x,t) > 0, 
ne Z4. Let Lnaif 2 0 (< 0) then 


ig n(x, 8)Lnzif(s)ds >0 (<0) (8.131) 
and 
f(a) >(S)SoLifOgi(a,t), e>t, ne Zy. (8.132) 
1=0 


Let n be even then g,(x,s) > 0 for any x, s € [a,b]. If Lnyif > 0 (< 0) and a <t, 
then 


i. In(2, 8)Lnsif(s)ds <0 (> 0), (8.133) 
and 
< (>) So Lif aie.) (8.134) 
1=0 


If n is odd, then for x < t by (8.81) we get gn(a,t) <0. If Dn4if > 0 (< 0) and 
x <t, then (8.131) and (8.132) are again true. 


Conclusion. If n is odd and Ln4if > 0 (< 0), then 
2 (Sy Lf Ogee) (8.135) 
i=0 
for any x,t € [a,b], n © Zy. 


In particular, by f(1) =0,a<1l<ba< : <b, a.e. on X we obtain: 
If n is odd and Ln+if > 0 (< 0), then 


> (S) Lif ()gile.1), any x € (a,b) (3.136) 
Hence we have 
f (2) > (<) So Lif agi (2.1) , ae. on X. 
i=1 


Conclusion. We have established a 


Ty (yi; Jia) yh fl if q9: (2.1) dd, (8.137) 


when n is odd and In4if > 0(< 0). 
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8.4 Results Based on an Alternative Expansion Formula 


We need the following basic general result (see also (3.53), p. 89 of [125] and [36]). 
We give our own proof. 
Theorem 8.24. Let f € C”({a, 6]), [a,b] CR, n EN, x, z € [a,b]. Then 


% ie a — z)kt1 +1)” /* Fern 
| fat = SY Oat at fea f™(e)dt. (8.138) 


Proof. From [327], p. 98 we use the generalized integration by parts formula 
[wae = uD — yy) 4 yly(-9) — op (-1" fu oat, (8.139) 


where here 


u(t) = f(t), v(t) = (8.140) 


Notice ul) (t) = &= aa ,k = 1,2,...,n-1 and v(t) = 1, and u)(z) = 0, 


k=1,2,...,n—1. Using (8.139) we have 


Thus 
° Sy x —z)? x—z)* 
fat = saya —2) — fo) ED 4 pry 2D — po EF 
+O Pu arrernae 
n-1 jee (—1)” x oe 
a) f(x Qa taf (t—z)"f™(t)dt, (8.142) 


proving (8.138). 


Here again0<a<1<b, f)=0,a< P<, a.e. on X and all the rest as in 


this chapter and f € C"t!({a,b]), n € N. We plug into (8.138) instead of f, f’ and 
z= 1, we obtain 


teed ev —1)kt} _4)\n pe 
Fle) = Spe sy I I [ e-arre ae, (8.143) 


k=0 


all x € [a,b], see also (2.13), p. 5 of [101]. We call the remainder of (8.143) 


v1(z) = owls / “(t SV Fade: (8.144) 


n! 
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Consequently we have 


1 k+1 
u(8) d ve () (k +1)! pI af (Gay ee 


er 


a.e. on X. 
We have proved our way the following representation formula (see also (2.10), 
p. 4, of [101]). 


Theorem 8.25. All elements involved here as in this chapter and f € C"*"([a, bj), 
nEN,0O<a<1<b. Then 


n-1 
(—1)* | (k+1) (2) -k k+1 
T¢(M1, #2) = f —|)q “(p—q)**' dA F yo, 8.146 
where 
(Lye Pid n ¢(n+1) 
We i= q (¢-1)"f (t)dt ) dy. (8.147) 
n. zr 1 
Next we estimate w2. 
We see that 
wr (a)| < IFO Hewstotl jy 1"*1, all x € (a, 8). (8.148) 
= (n+1)! , ; 
Also 
(n+1) 
wr (ax)| < (Snel 1", all x € [a,d). (8.149) 
ne 


Furthermore for a, 3 > 1: 4 + 4 = 1 we derive 


FO Letaai 


n+4 
lui (x)| < n(n 1B jen—1|"'?, all ze [a,d]. (8.150) 
Then 
hae oo, [a,b] —n n+1 
rer paar, 


p . (n+1) a 
wy (2)| < min of [PO haat) pimps — gin, 


sie a,[a,b] 1-n—4 n+4 1 1 

a B —_ B 1: _— -= 1. 

ninB + hve? ppg Ryo els ee 
(8.151) 

All the above are true a.e. on X. 


We have proved (see also the similar Corollary 1, p. 10, of [101)). 
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Theorem 8.26. All elements are as in this chapter and f € C"*1([a,b]), n EN. 
Here 2 is as in (8.147). Then 


fr [hss tacti ‘| 2 i 
By = "Ip —@Q|"" dA, 
1 m+ D ie Ip—al 
er) iia es | 
v2] <min of 4 By := fa | g-"\p—q|"dd, fora,B>1:-+—=1, 
poe a, [a,b] i {ened 44 
B3 := ————— "~B\lpn —q|""Bdd. 
3 = mat ie I! Ip —4| 
(8.152) 
Also it holds 
n-1 
1 P\ _— : 
T'p(u1, 42) < > (e+ 1)! i. pern(2)g *(p — q)***dA| + min(B1, Bo, Bs). 
k=0 
(8.153) 
Corollary 8.27. (to Theorem 8.26). 
Case of n=1. It holds 
Wy 
Bie ee Il f Ilsesfas6] ‘ q-i(p— q)2d), 
2 x 
: ” 1 1 
|wo| < min of Bo, = Il f Ila, [0,5] [p> q\dy, for a, 3 >1: a + B = 1, 
x 
IF leestasi | ey 144 
B33 = —————— Bip — adX. 
31 = Bane |! Ip— 4 
(8.154) 
Also it holds 
T p(p01, He) < i f' (2) (p— 7) + min(B,1, B21, B31). (8.155) 
x 
Remark 8.28. (to Theorem 8.25). Notice that 
SF f eo urea = SF fF e-meme@ - rem aae 
= dere 
—yryetyy iD" 8.156 
+ Curren se (8.156) 


Therefore from (8.143) we obtain 


n-1 r— k+1 _71)n 
fe) = Sahn SE + Fay 


k=0 


FOF (1)(e —1)"** + a(2), 


(8.157) 
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where 


_ (1) 


n! 


fe Sephora = Fe) aye (8.158) 
1 
Then for 0 < h < b—a we obtain 

eee = fe-ariem@- se aya 


(by Lemma 7.1.1, p. 208, of [20]) 
Al fOrD,h 


ae ii It in | a an 
n. 
(n+1) = 
n! 1 =, 
(n+1) h x 1 x 
= ae i |t — 1|"dt + xf jaya 
ED) hi) © 
7 ae Hf je 1a + all ear} 
1 
We have proved for 0 < h < b—a that 


fry x 
balay] s AE eared ee] feral, (e159) 
1 


all x € |a, }]. 
Next suppose that |f("+)(t) — f+ (1)| is convex in t and 0 < h < min(1— 
a,b—1) then 


wa 


IA 


| | 


aes =| a le — apr) poe) 1) = Fa 
(by Lemma 8.1.1, p. 243, of [20]) 
(n+1) h) 

wis) is le — 1)" ae] . (8.160) 
nih 1 


By 
= 1 m+1 
if It — i1mat] = ee _ , mé-l, 


we get the following 
For 0 < h<b—a we have 


wi( fort, h) fie ne Sah ee uy 


n n+1 h (n+2) ge-161) 


lvs3(x)| < 


and for 0 <h < min(1—a,b—1) and |f("+)(t) — f+(1)| convex in t we obtain 


wi (ferry, h) 


n+2 
CET Fee Hae (8.162) 


|Ws(x)| < 
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néN, all x € [a,b]. The last inequality (8.162) is attained when n is even by 


fQ= — <t<b. (8.163) 


Really f € C”*+!({a,]), it is convex and strictly convex at 1. We get f(*)(1) = 0, 
k =0,1,...,n +1 where f("+)(t) = t — 1 with obviously |f(t)*)| convex and 
wi (fern, h)=h. 
For f the corresponding 
* Pre oe ay 
va(z) = n(n + 2)? 


proving (8.162) attained. Here again 


x € [a,b], (8.164) 


(2) ~ : fen (2) gq *(p—q)*** 


— FD (1)¢-"(p — gr + abs (2). (8.165) 


a.e. on X. 
We have established 


Theorem 8.29. All elements involved here as in this chapter and f € C"*"([a, dj), 
nEN,0O<a<1<b. Then 


_, DF (k+1) [P)\ ke, oy kt] 
Pyle) = Sep ay fe! (E)a (p—q)""'dr 


~ fH) (1) i, q"(p—a)"t da + Ya, (8.166) 
: xX 


a = f avs (2) dd. (8.167) 


Let 0<h < b—a, then from (8.161) we have 


us (2) < wi( fOr hb) fgg lg ip = lh? 
q/\|— n! n+1 h (n + 2) : 


where 


Next we estimate wu. 


q (8.168) 


therefore 


—n = n+1qy 
— fa Ip — al 


+o | eA p= arar} neN. (8.169) 
x 
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When 0 < h < min(1—a,b—1) and | f+) (t) — f+ (1)| convex in t we obtain 


Dp wi (fDi hr) Fe 
jl niga a 
(5) “a(n + 2h 4 Ip—al"™™, (8.170) 


and 


ay uae: +2 
Salat ah * Da 171 
~ nl(n+2)h ee Ip—q|""dX, neEN (8.171) 


For n even the last inequality (8.171) is attained by f by (8.163). 
We have proved 


[al 


Theorem 8.30. All elements involved here as in this chapter and f € C"*"([a, dj), 
nEN,0O<a<1l<band 


u —n-1 +2 
er - —@q\""*dA < b—a. 8.172 
aan fe Ip — 4 <b-a (8.172) 
Then (8.166) is again valid and 
ra] < 
1 n 1 —n— n 
Phe G oy fa ‘|p-al “4an) 
1+ : | gq "|\p—q|"t*dd | . (8.173) 


Also we have 


Theorem 8.31. All elements involved here as in this chapter and f € C"*'({a, bj), 
néN,0<a<1 <b. Furthermore we suppose that |f@t)(t) — f@+0(1)| is 
conver in t and 


1 —n-1 +2 : 
—_—— 2 —q\" dA < 1- —1). 174 
<aey [ote ahaa < min(t — a,5—1) (8.174) 


Then (8.166) is again valid and 


1 
< (n+1) | —n-1 n+2 ; a 
\wa| SW1 (1 , ni(n 2) qd |p q| dX (8 75) 


The last (8.175) is attained by f of (8.163) when n is even. 
When n = 1 we derive 


Corollary 8.32. (to Theorem 8.30). All elements involved here as in this chapter 
and f € C?({a,b]), 0<a<1<b and 


1 
0< =| q *|p—q|?dA < b-a. (8.176) 
3x 


Then 
eee di I (£) (p-gax- £0 a @'p—9)2d\+v4., (8.177) 
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where 
Lf, Pp 
Wa. c= [aves (2) dX, (8.178) 
with 
03,1(2) = -{ (t—1)(f"(t) — f"())dt, x € [a, d). (8.179) 


Furthermore it holds 


1 1 
lesa son ( "5 fp aba) i ([ atytan +1). (8.180) 


Corollary 8.33. (to Theorem 8.31). All elements involved here as in this chapter 
and f € C?({a,b]), 0<a<1<b. Furthermore we suppose that |f’(t) — f’(1)| is 
conver in t and 


1 
0< 5 | g ?lp — qdd < min(1 — a,b— 1). (8.181) 
x 
Then again (8.177) is valid and 
1 
lai] Swi ( i =| q *|p- aida) : (8.182) 
x 


Remark 8.34 (to Theorem 8.29). Here all as in Theorem 8.29 are valid especially 
the conclusions (8.166) and (8.167). Additionally assume that 


ra Wisk ae, (8.183) 
K>0,0<a<1,allz,y € [a,b] with (x -—1)(y—1) > 0. Then 


K| f* K 
Say, t—1 ame) ———_ mre | nta+l1 8.184 
wal < | It 1 oer rr 
all x € [a,b], n € N. Therefore it holds 
‘ as re nta+l 
¢) |= Motard _ ec. on X. 1 
a (2)|< aay lp — q| » ae. on (8.185) 


We have established 


Theorem 8.35. All elements involved here as in this chapter and f € C"*"([a, dj), 
nEN,0<a<1<b. Additionally assume (8.183), 


aE ay) ele 
K>0,0<a<1, all z,y € [a,b] with (2x -—1)(y—1) >0. Then we get (8.166), 


n-1 1 


Py (Ha, M2) = 
k=0 


(=1)” n+1 —n n+1 
Grp ay fa (p—q)"ttd\ + Wa. 
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Here we obtain that 


ya] < sceae I gq |p—q|?tottdy. (8.186) 
Inequality (8.186) is attained when n is even by 
f@=dr-1"°",,. wea, o| (8.187) 
where 
a (8.188) 
HL Meee 1— 7) 


Proof of sharpness of (8.186). Here f* € C"*1({a,b]), is convex and strictly 
convex at x= 1, f*(1) =0. 
We find that 
fr) (2) = K|ax — 1|* sign (x — 1) (8.189) 
with f*(1) = 0, k =1,2,...,n +1. Furthermore f*(*» clearly fulfills (8.183), 
see in p. 28 of [29] inequality (105). 
Additionally we obtain the corresponding 
K\x —1)rrert 
w3(2) — n(n tat 1) ry 
Then the corresponding 


Pp kK —n-a nt+at+l 
2: =a ee = dd, 8.191 
o [afes(2) CEU poo ( ) 


proving that (8.186) is attained. 


x € [a,b]. (8.190) 


Finally we present 


Corollary 8.36 (to Theorem 8.35). All elements involved here as in this chapter 
and f € C?({a,b]), O0<a<1<b. Additionally assume that 


LP" (x) — fy) < Kx — yl*, (8.192) 
K>0,0<a<1,allz,y € [a,b] with (x —1)(y—1) > 0. Then we get (8.177), 
iW 1 
Ds(t1, 2) = Vis (2) (p — q)dv — FD [pad + as 


where 
with 


It holds that 


K 
\thaji] < ary fee git dy. (8.193) 
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Chapter 9 


Csiszar’s f-Divergence as a Measure of 
Dependence 


In this chapter the Csiszar’s f-Divergence or Csiszar’s Discrimination, the most 
general distance among probability measures, is established as the most general 
measure of Dependence between two random variables which is a very important 
aspect of stochastics. Many estimates of it are given, leading to optimal or nearly 
optimal probabilistic inequalities. That is we are comparing and connecting this 
general measure of dependence to known other specific entities by involving basic 
parameters of our setting. This treatment relies on [39]. 


9.1 Background 


Throughout this chapter we use the following. Let f by a convex function from 
(0,-+00) into R which is strictly convex at 1 with f(1) = 0. Let (R?,B?,.) be the 
measure space, where \ is the product Lebesgue measure on (R?, B?) with B being 
the Borel o-field. And let X,Y: 9 — R be random variables on the probability 
space (0, P). Consider the probability distributions wxy and px x py on R?, 
where xy, “x, Wy stand for the joint distribution of X and Y and their marginal 
distributions, respectively. 

Here we assume as existing the following probability density functions, the joint 
pdf of uxy to be t(x,y), z,y € R, the pdf of tx to be p(x) and the pdf of py to 
be q(y). Clearly ux x py has pdf p(a)q(y). Here we further assume that 0 <a < 
pl ae. on R*? anda<1< 0b. 

The quantity 


Pyluxrsax xar) =f oat (FEM) ane, 4) 


is the Csiszar’s distance or f-divergence between uxy and [x X py. 

Ty is the most general measure for comparing probability measures and it was 
first introduced by I. Csiszar in 1963, [139], see also [140], [152], [244], [243], [305], 
[306], and [101], [102], [125], [126], [37], [42], [43], [41]. 

In Information Theory and Statistics many various divergences are used which 
are special cases of the above I’y divergence. ['f has many applications also in 


101 


102 PROBABILISTIC INEQUALITIES 


Anthropology, Genetics, Finance, Economics, Political Science, Biology, Approx- 
imation of Probability Distributions, Signal Processing and Pattern Recognition. 
Here X,Y are less dependent the closer the distributions uxy and ux X py are, 
thus Ty(uxy, [tx X fy) can be considered as a measure of dependence of X and Y. 
For f(u) = wlog, u we obtain the mutual information of X, and Y, 
(X,Y) = I(uxy || ux x wy) =Tutog, u(uxy, Ux X py), 
see [140]. For f(u) = (u— 1)? we get the mean square contingency : 
y?(X,Y) = Pu—1)2(Uxy, EX x Ly), 


see [305]. In the last we need uxy < [ux X fy, where < denotes absolute continuity, 
but to cover the case of uxy K fx X py we set y?(X,Y) = +00, then the last 
formula is always valid. 

Clearly here uxy, bx X py <A, also T¢(Uxy, bx X py) > 0 with equality only 
when xy = lx X My, ie. when X,Y are independent r.v.’s. In this chapter we 
present applications of author’s articles ([37]), ([42]), ([43]), ([41]). 


9.2 Results 


Part I. Here we apply results from [37]. We present the following. 
Theorem 9.1. Let f € C1({a,b]). Then 


Dy(uxy, ux x by) SIF" lloo,fa,) is \t(x,y) — p(@)aty)|dX(z,y). (9.2) 


Inequality (9.2) is sharp. The optimal function is f*(y) = |y—1|%, a > 1 under 
the condition, either 


i) 
max(b—1,l—a) <1 and  C :=esssup(pq) < +00, (9.3) 
or 
ii) 


lt —pq|<pq<1, ae. onR?. (9.4) 


Proof. Based on Theorem 1 of [37]. 


Next we give the more general 


Theorem 9.2. Let f € C"*1({a, b]), n € N, such that f (1) =0, k = 0,2,3,...,n 
Then 


Ts(uxy, bx X seas 


(n+1) 
LE Moo ta,0) wt | ola ay)" lta.) = (way) I"**dX(w,y). (9.5) 


+0)! 
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Inequality (9.5) is sharp. The optimal function is f(y) := |y—1\"t®, a > 1 under 
the condition (i) or (tt) of Theorem 9.1. 


Proof. As in Theorem 2 of [37]. 


As a special case we have 


Corollary 9.3 (to Theorem 9.2). Let f € C?([a,b]), such that f(1) =0. Then 


(2) 
Pyuxysux xp) <A Ff (p(2))-'(a(y))- tle, 9) ~plw)aly))? ANC. 

(9.6) 
Inequality (9.6) is sharp. The optimal function is f(y) := ly —1|!*°, a > 1 under 
the condition (i) or (i) of Theorem 9.1. 


Proof. By Corollary 1 of [37]. 


Next we connect Csiszar’s f-divergence to the usual first modulus of continuity 


Wy. 


Theorem 9.4. Assume that 


o<hi= f lean) —lealu)|AXa,y) <min(1—a,b—1). (9.7) 
Then 
Dy(uxy, bx X by) Swil(f,h). (9.8) 
Inequality (9.8) is sharp, namely it is attained by 
f*(@) = |a— I]. 


Proof. Based on Theorem 3 of [37]. 


Part II. Here we apply results from [42]. But first we need some basic background 
from [26], [122]. Let v > 0, n := [v] anda =v—n(0<a< 1). Let z,29 € [a,b] CR 
such that x > x0, where Zo is fixed. Let f € C({a,6]) and define 


x 


(72° f\(e) =a i (x — s)""1f(s)ds, a <a<b, (9.9) 


(0) 


the generalized Riemann-Liouville integral, where I stands for the gamma function. 
We define the subspace 


Cxo([a, 0)) = {f € C"([a, 6): Jy2,D"f € C*([x0, b))}- (9.10) 


For f € CY, ([a, b]) we define the generalized v-fractional derivative off over [ao, b] 
as 


Dy f= DIF) (v = =) (9.11) 
a dx 


We present the following. 
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Theorem 9.5. Leta<b,1<v<2, fe Cr([a,b]), andO<a< a <b, ae. on 
R?. Then 

Ty(uxy, ux X py) 


Be Ttetetl f (o(a))' (aly)! bles) — aploialy))” AXay) (0.12) 


Proof. It follows from Theorem 2 of [42]. 


The counterpart of the previous result comes next. 


Theorem 9.6. Let a < b, v > 2, n:= [vy], f € CX%([a,d]), fO(a) = 0, i = 


0,1,...,»-1, anda< ae <b, a.e. on R?. Then 
p(x)a(y) 


Ty(uxy, ux X by) 


< eT Lote) *(ala)) teas) — ante )aly))"aXlasy) (9.3) 


Proof. Based on Theorem 3 of [42]. 


Next we give an Lg estimate. 


Theorem 9.7. Leta <b, v > 1, n:= [vy], f € C%([a,d]), f(a) = 0, i = 


0,1,...,n—1, anda< oI <b, ae. on R?. Let &,8>1: z+g= 1. Then 
| De fille, {a,0) 
Tr pam i 
HMbxvi be WS Toy 1) + I 
: [ (r@)a(y))?-"# (t(@, 9) — ap(a)q(y))" FF aXa,y). (9-14) 


Proof. Based on Theorem 4 of [42]. 


It follows an L., estimate. 


Theorem 9.8. Leta < b, v > 1, n:= [vy], f € CX([a,b]), f(a) = 0, i = 
0,1,...,»-1, anda< oI <b, a.e. on R?. Then 


ehtedl [ a)a(y))®*(t(e.9) ~ar(a)aly))? *aa(esy) 


(9.15) 


Ts(uxy, ex X py) < 


Proof. Based on Theorem 5 of [42]. 


We continue with 


Theorem 9.9. Let f € C'([a,b]), a#b. Then 


1 b 
Pyluxysnx x nv) So | fle)de 
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Proof. Based on Theorem 6 of [42]. 
Furthermore we obtain 


Theorem 9.10. Let n odd and f € C"+"({a,b]), such that ft) > 0 (< 0), 


0O<a< na <b, a.e. on R?. Then 


b 
Ppluxysux x nv) <(2)p—— [flea (9.17) 


n 


os aa (>: [. i? (=) (pq)! (bpq — t)°— (apg — ota] 
i=1 " \k=0 


Proof. Based on Theorem 7 of [42]. 


Part III. Here we apply results from [43]. 
We start with 


Theorem 9.11. Let f,g € C1([a,b]) where f as in this chapter, g' #0 over [a,b]. 


Then 
oo, [a,b] c plz)a(y) ) (<) = w(t) dX(a, y). 
(9.18) 


f' 
gy 


Ty(uxy, ex X by) < | 


Proof. Based on Theorem 1 of [43]. 
We give 


Examples 9.12 (to Theorem 9.11). 
1) Let g(x) = 4. Then 


Pyluxy six x wr) € [Fleas f PL2)IY) Io», 4) — ple)a(y)|drle,y). (9.19) 


R2 t(a,y) 
2. Let g(x) = e*. Then 


t(w,y) 
er@ay) —e 


dX(a,y). (9.20) 


Pyuxyspx X pr) < lle" Fllcsoss f wCe)aty) 
3) Let g(x) = e~*. Then 
__t(@,y) ~ 
Pyuxy sx X tr) <lle*F'loojaay f ola)aty) le F57 — e-4| aX(v,y)- (9.21 
R 


4) Let g(x) = éna. Then 


Pyuxvstx x tv) < laf" lojoa f. w(a)ay) fin (SEH )| ar(ery). (9.2) 


5) Let g(x) = vén x, with a > e~!. Then 


fi 
———<—____. t 
1+ ln X ||. 14,0] I. (2,4) 


tn (2) | dX(x,y). (9.23) 


Ts(uxy, ex X py) < | 
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6) Let g(x) = /x. Then 


Dys(uxy, bx X My) < 2I|VeF"llo0,{a,5] ie VP(x)a(y) lV ta, y) — VP(@)a(y)|dA(a, y). 
(9.24) 
7) Let g(x) = «*,a>1. Then 


Tp(uxy, x X by) 
f [ e@awy*lee.9)* — We)ay))"!4Ax,y)- (9.25) 
oo, [a,b] YR? 


1 
<a 
a 


zol 
Next we give 

Theorem 9.13. Let f € C'([a,b]), a#b. Then 

If’ Iloo, {a,b} 


ee HS Moo 1a;0) 
~  (b-a) 


(—*) —(a+b) 


+ [oem oe nares]. (9.26) 
R2 


1 b 
Ts(uxy, bx X py) — —/ f(y)dy 


Proof. Based on Theorem 2 of [43]. 
It follows 


Theorem 9.14. Let f € C?)({a,b]), a4 b. Then 
b = x 7 
Tp(uxy, x X by) — —/ f(s)ds — (o) (1 = =") 


(9.27) 


a 


é ty) (atb))2 : 
Pr tmal freon |SREEM 3) avon a} 


2 


Proof. Based on (33) of [43]. 


Working more generally we have 


Theorem 9.15. Let f € C'({a,b]) and g € C({a,b]) of bounded variation, g(a) 4 
g(b). Then 


b 
Py(uxy, ux X by) — aoe | F(s)dg(s) 


x I"tmwat f pq (/ P («(=).a(s)) 


i) a}, (9.28) 


where 
g(s) — g(a) 
go) gay) “S95 * 
P(g(z), 9(8)) = =< (9.29) 
g(s) = 9 
gid) gay) SS 
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Proof. By Theorem 5 of [43]. 
We give 
Examples 9.16 (to Theorem 9.15). Let f € C'({a,0]) and g(x) = e”, nz, V2, 
x*,a>1;x2>0. Then by (71)-(74) of [43] we obtain 
1) 


Ty(uxy, Mx X by) sta fs f(s)e*ds 


If" 0,fa,8] eae et a) +e 
Seay i d\ — e*(2—a) + e%(b 2}, (9.30) 


Ts(uxy, Ux X py) — G (2 )) {2 F(6) 
< | f'lloo.ta.t (on(2)) fo [ten er (9.31) 


3) 


2) 


. 7 i ” f(s) 
Ty (uxy, wx x ply) awe! We ds 


(22 48/ 
2 Wlesites {5 [eo erans eae) = Das vi}, (9.32) 


* (Vb— Va) 
4) (a >1) 
Tp(pexy, x X by) mw fH f(s)s°~ 1s 
S 4 7 co 
+ (- - :) (att? 4 petty — (a2 4 yh (9.33) 


We continue with 


Theorem 9.17. Let f € C)({a,b]), g € C([a, b]) and of bounded variation, g(a) 4 
g(b). Then 


Ty(uxy, x X by) — cata [I )dg(s 


S oeran ( i ba 7 ( I, (v i ” (=) .)) ts) a) 
S [lle ( iM (( lf \P(o(S) 90) Plae).obe dee) a). 


(9.34) 


108 PROBABILISTIC INEQUALITIES 


Proof. Apply (79) of [43]. 


Remark 9.18. Next we define 


* _fs-a, sé[a,z] 
P*(z,8):= {: ah Selb. (9.35) 


Let f € C1({a, b]). Then by (25) of [43] we get the representation 


b 
Dy(uxy, ex X py) = — (/ Heys +R.) : 


where according to (26) of [43] we have 


_ oe tay) ’ 
Ri = a P(x)a(y) (/ a (Svs) f ov) d(x, ¥). (9.36) 


Let again f € C1({a,b]), g € C({a,b]) and of bounded variation, g(a) 4 g(b). Then 
by (47) of [43] we get the representation 


b 
Pyluxv ix Xi) = oy ff Fleldals)+Ra, (9.37) 


where by (48) of [43] we obtain 


= f. a ([2(0 ~) a) sey) dy. (9.38) 


Let &,@>1: $+ 7 = 1, then by (89) of [43] we have 
t(x, y) 
Ral < Wf" llata, [roa P | dX(x,y), 9.39 
1 | | ,[a,b] ies ( ) ( ) p()q(y) Ae ( ) ( ) 


and by (90) of [43] we get 


Ral <I" Ihyox [wear (SE#..)| axe. 2.40) 


P(Z)a(Y)” J Neo, fa,0) 
Furthermore by (95) and (96) of [43] we find 
dX(x,y), (9.41) 


t(a, y) 
r(o (Ss) gt ) pee 


t(a, y) bs 
(o(a54%) 00) aNEs > AUS) 


[a,b] 


Ral <[lf'llato i p()q(u) 


R2 


and 


eS 


Ral < IF loo f  wlaaty 


Remark 9.19. Here 0 <a <b with 1 € [a,}], and = € [a, 8] a.c. on R?. 
(i) By (105) of [43] for f € C({a, b]) we obtain 


Ral < Asian fa)a(y) max (FEB a,b AED) ariey). (0.43) 


p(x)a(y) p(x)a(y) 
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(ii) Let g be strictly increasing and continuous over |a, }], e.g., g(x) = e”, fn az, 
Jax, x* with a > 1, and x > 0 whenever is needed. Also f € C1({a,b]) and as in 
this chapter. Then by (106) of [43] we find 


IF’ Ila,f0,0) i fase t(a,y) 
Ral = cay —btayy J Podeted ma (. (wenn) 


Jit _ (te) . 
g(a), 9(0) ~9 (ae) a 1). (9.44) 
In particular via (107)—(110) of [43] we obtain 
1) 
|[Ra| < en a p(x)q(y) max (crit — ee? — emteyaty ) dX(a,y), (9.45) 
2) 
Flt ta,0) eg(y) max{ én ( £62¥) 
Real < abt) fy MOM ¢ Corny) 
—flna,lnb— kn (2, ¥) x 
éna,lnb— (ee) Ja .y); (9.46) 
3) 
Il f’Il1,[a,0) . dues U(x, y) 
OO eae heya) 
_ ve, | ew g 
VO et a 7 ake 
and finally for a > 1 we obtain 
4) 

IF’ ha ta,0) 4 Was Ee) — a&. pt — i* (x54) x 
al < a eae feel) mex ( Torte — 8" — Gets ae 
At last | 

iii) Let @, 8 > 1 such that 3 + 3 = 1, then by (115) of [43] we have 
Ral < If’ lafa.0) 
a/ (t(x,y) — ap(x)a(y))P+* + (bp(x)a(y) — t(a, y))P** 7 
i ( (8+ Nypleaty) pe 
(9.49) 


Part IV. Here we apply results from [41]. 
We start with 
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Theorem 9.20. Let0<a<1<b, f as in this chapter and f € C”({a,b]), n> 1 
with |f™(s) — f™(1)| be a convex function in s. Let 0 <h < min(1—a,b—1) be 
fixed. Then 


(k) ( 
Typ(uxy, ux x py Dy as vo 2) a Loon )'“*(t — pq)*da 


(n) 
ety (oa)-"\t—pal4dd. (9.50) 
R2 


(n+ 1)!h 
Here w, is the usual first modulus of continuity. If f(1) =0, k =2,...,n, then 
ee - 41 

T < — > "It — pq|"*- dy. 9.51 
pluxy x X py) < CEST (PD) |t — pq| (9.51) 

Inequalities (9.50) and (9.51) when n is even are attained by 

5 |s = perk 

= ——_—_ <s<ob. 9.52 
ils)= Sass (9.52) 


Proof. Apply Theorem 1 of [41]. 


We continue with the general 


Theorem 9.21. Let0 <<a<1<b, f as in this chapter and f € C"([a,b]), n> 1. 
Assume that wi(f™, 6) <w, where0<d<b-—a,w>0. Let x € R and denote by 


dn (a) = s [=| —— dss (9.53) 


where [-] is the ceiling of the manor: aie 


(k) 
T's(uxy, bx X My) > vo ul UY fo )I-*(t — pq) May 


4. 7 padn € ot dd. (9.54) 


Inequality (9.54) is sharp, namely it is attained by the function 
fr(s) := won(s—1), a<s<b, (9.55) 


when n is even. 


Proof. Apply Theorem 2 of [41]. 
It follows 


Corollary 9.22 (to Theorem 9.21). It holds (0 < 6 < b—a) 
(k) (1 

Pw | (pq)'*(t — pq)*dar 
BJs 


Ty(uxy, bx X by) < ye 
k=2 
+ wi(f, 6) —an | (pq)~"|t — pq|”**dr 
ACES EN Pe 
1 
5 =| (pq)'—"|t — pql"dr 
Nn: R2 


ave 2-n n-1 
7 soon [0 |é — pal al (9.56) 
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and 


n 


FOL 
Tp(uxy, x X py) Sa ul af (pq) *(t - pa) 


n! 


+ ime [o0'"e—palnan 


1 —n n+l 
+ pea ips flea eal a}. (9.57 


In particular we have 


1 
Tp(uxy, bx x py) < TTX [oot — pq)?dd 
1 6 
=f al |t — pq|dd + sh. (9.58) 
2 R2 8 


T's(uxy, ux X by) < nto f. |t — pq|dX + x [000° oaran} (9.59) 


and 


Proof. By Corollary 1 of [41]. 


Also we have 


Corollary 9.23 (to Theorem 9.21). Assume here that f\™ is of Lipschitz type of 
order a, O<a<1, ie. 


wi(f™,5) < Kd", K>0, (9.60) 
for any0 <6 <b-—a. Then 


nr | fk) 
Typ(pexy, ex X by) < S- lf) 


| (pq)'*(t — pq)*dr 
R2 


K 

—— | (pq)!-"-4|t — pql"*@a. (9.61) 
II (a +4) “®” 

4I=1 


When n is even (9.61) is attained by f*(x) = clx —1|"**, where c:= K/([] (a+ 
i=i 
i)) > 0. 


Proof. Application of Corollary 2 of [41]. 


Next comes 


Corollary 9.24 (to Theorem 9.21). Assume that 


b=a> , (pq) 't?d\ —1>0. (9.62) 
R2 
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then 
1 
Ty(uxy, px X py) Sw @ (| (pq) 't?dA — 1)) { |t — pgldd + 5} 
R2 R2 
(9.63) 

Proof. Application of Corollary 3 of [41]. 
Corollary 9.25 (to Theorem 9.21). Assume that 

| |t — pq|dX > 0. (9.64) 

R2 


Then 


T'p(uxy, x X by) )sui(s sf te patar) { f i a 
- 30s iy ae It pala) (9.65) 


Proof. Application of Corollary 5 of [41]. 


We present 


Proposition 9.26. Let f as in this chapter and f € C({a, }]). 
i) Assume that 


| |t — pq|dA > 0. (9.66) 
R2 

Then 

Py (uxy, Mx X by) < 204 Ga 1 pala (9.67) 
R 
ii) Let r > 0 and 
b-azr | |t-paldr>o. (9.68) 
R2 
Then 
1 
Ty(uxy, ex X py) < (1 BLE a (4+ [ I~ pala : (9.69) 
R2 


Proof. Application of Proposition 1 of [41]. 
Also we give 


Proposition 9.27. Let f as in this setting and f is a Lipschitz function of order 
a,0O<a< 1, ie., there exists K >0 such that 


f(z) -f@|< Kle—yl*, all z,y € [a, 9). (9.70) 
Then 


Py(uxy sux x py) SK f(g)!" palaa. (9.71) 
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Proof. By Proposition 2 of [41]. 
Next we present some alternative type of results. We start with 


Theorem 9.28. Let f € C"*1({a,b]), n € N, then we get the representation 


n-1 
7 (=1)* (k-+1) (= 
Peluxy wxxnr) =) om fe Pq 


where 
a "i s—1)” (n+1) s)ds 
[09 (/ ae Ni ota yn) 


Proof. By Theorem 12 of [41]. 
Next we estimate w2. We give 


Theorem 9.29. Let f € C"t!({a,b]), n EN. Then 
[ee es: a,b —n n 
= 4 f (pq) "|t— pal"** dd, 


eas (n+ 1)! 


Rigas es ae 
= | (pq)" |t — pq|" dd, 
R2 


By . | 
ni 


|e] < min of 
1 
and fora, B>1: = 
a 


= f° eta fn oles een 
Bs'= Tne + Due [09 a\t—pq|"*P dd. 


Also it holds 
n-1 
Tp(pexy, ex x Ly) < Sis +d! 
k=0 , 


+ min(B,, Bo, B3). 


Proof. See Theorem 13 of [41]. 


We have 
Corollary 9.30 (to Theorem 9.29). Case of n= 1. It holds 


od, LF lsesteacti at 2 
Bio (pq) ~(t — pq) dd, 
R2 


Bi ea if It — paqlda, 
R2 


Ld 


-=1, 


|v2| < min of 
and fora, B>1,=+ 
a fp 


IF lle,fa,0) 16 eee 
-— oe = B 
B31: CES [09 |t — pq| "dd. 


| ar (=) (pq) *(t — pq)*t hdr 
R2 Pq 


i ) (pa) F(¢—pq)** debe, (9.72) 


(9.73) 


(9.74) 


(9.75) 


(9.76) 
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Also it holds 


Ty(uxy, bx X py) <|ff (=) (t - pa) +min(B,1,B21,B31). (9.77) 


Proof. See Corollary 8 of [41]. 
We further present 


Theorem 9.31. Let f € ide n EN, then we get the representation 


a fs ae. (=) (pq) *(t — pa)***dr 


Fay Loa" = pa)" + 4, (9.78) 


T(uxy, x X py) oe (s+ 
iy 


k=0 


tii 


m+n! 
where 
Y= im pqs (=) dX, (9.79) 
with 
3 (x) = —s [o SOs rr (Das: (9.80) 


Proof. Based on Theorem 14 of [41]. 


Next we present estimations of qq. 


Theorem 9.32. Let f € C"*!({a,b]), n € N, and 


i —n-1 42 
—_—_—_ ie — pq|"'“dr < b—-a. 81 
0< ay [00 tt pala < b~ a (9.81) 


Then (9.78) is again valid and 


i 
wal < Sarl fr, (pq)~"—"|t — pq|"t7dr 
ni R2 


1 
5 (1 + — | (pq) |t- pal*"dA). (9.82) 
n+1 R2 


n+2 


Proof. By Theorem 15 of [41]. 


Also we have 


Theorem 9.33. Let f € C"*+([a, b]), n EN, and |f("t)(s) — f"+Y(1)| is convex 
in s and 


1 —n-1 n+2 : 
0< eH fa lt — pql"*"dd < min(1 — a,b 1) (9.83) 


Then (9.78) is again valid and 


[bal Sur (ae ey [oa tie mal"*00) (9.84) 


Csiszar’s f-Divergence as a Measure of Dependence 115 


The last (9.84) is attained by 
a<s<b (9.85) 


when n is even. 


Proof. By Theorem 16 of [41]. 


When n = 1 we obtain 


Corollary 9.34 (to Theorem 9.32). Let f € C?({a,b]) and 


0< ae (pq)? |t — pq|?dd < b—a. (9.86) 

Then 
Ty(uxy, px X py) = is (=)e — pq)d\ — j “ [0a paar W, 
(9.87) 

where 
Var =| pqs, (=) dX, (9.88) 

R2 Pq 
with 

Tee --f @—1GF"@) 7" O))day @ E14 8. (9.89) 


Furthermore it holds 


1 1 
jesal son (1.5 [oe paar) 5(f oatartr). 90) 


Proof. See Corollary 9 of [41]. 


Also we have 


Corollary 9.35 (to Theorem 9.33). Let f € C?({a,b]), and |f@(s) — f(1)| is 


conver in s and 


1 
0< >| (pq)~*|t — pq|?dd\ < min(1 — a,b — 1). (9.91) 
R2 
Then again (9.87) is true and 
1 
Iaal Sw G 5 [ook - pala . (9.92) 
R 


Proof. See Corollary 10 of [41]. 


The last main result of the chapter follows. 


Theorem 9.36. Let f € C"*!([a,b]), n € N. Additionally assume that 
Pera Make =o" (9.93) 
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K>0,0<a<1, all x,y € [a,b] with (x —1)(y—1) > 0. Then 


n-1 _41)k 
rim 7 pom (=) (oa)*(e— pay 


i=, (k+1)! pq 
=1)" (nts) | - +1 
eat! "(t—pq)"t'd\+ v4. (9.94 
ge CEaNE (1) PD) (t — pq) +4. (9.94) 
Here we find that 
K 
a Se Ti Append x 9.95 
[Wal S AG eco) [09 |t — pal (9.95) 
Inequality (9.95) is attained when n is even by 
f*(x) =Aa—-1yrtet!, ee [a,b] (9.96) 
where 
K 
aio ages (9.97) 
I] fm+e+1—3) 
j=0 


Proof. By Theorem 17 of [41]. 
Finally we give 


Corollary 9.37 (to Theorem 9.36). Let f € C?({a,b]). Additionally assume that 


lf" (@) — f" @ |< Kle-yl*, K>0,0<a<1, all 2,y € [a,}] (9.98) 
with (2 —1)(y—1) >0. Then 
Ty(uxy, ux X py) = if r(=)e — pq)dd — “ I. (pq) *(t — pq)?dd + Wa. 
(9.99) 
Here 
wai = | pqs, (=) dX (9.100) 
R2 Pq 
with 
voate)=— f(s 1") — F"))ds, we [ad]. (9.101) 
It holds that 
\thai| < (a+) [eo — pq|**7 dd. (9.102) 


Proof. See Corollary 11 of [41]. 


Chapter 10 


Optimal Estimation of Discrete Csiszar 
f-Divergence 


In this chapter are established a number of sharp and nearly optimal discrete proba- 
bilistic inequalities giving various types of order of approximation of Discrete Csiszar 
f-divergence between two discrete probability measures. This distance is the most 
essential and general tool for the comparison of these measures. We give also var- 
ious types of representation of this discrete Csiszar distance and then we estimate 
tight their remainders, that is leading to very close discrete probabilistic inequali- 
ties involving various norms. We give also plenty of interesting applications. This 
treatment follows [45]. 


10.1 Background 


The basic general background here has as follows. 

Let f by a convex function from (0,-+00) into R which is strictly convex at 1 
with f(1) =0. 

Let (X,A, 2) be a measure space, where X is a finite or a o-finite measure on 
(X, A). Let 1, 2 be two probability measures on (X, A) such that 1 << A, wo <r 
(absolutely continuous), e.g. A = ti + pa. 

Denote by p = a q= ee the (densities) Radon—Nikodym derivatives of ,11, 
bg with respect to A Typically we assume that 


O<a<e <p, ae. on X anda<1<b. 
qd 


The quantity 


ytuasna) = faa) (BER) ance (10.1) 


was introduced by I. Csiszar in 1967, see [140], and is called f-divergence of the 
probability measures jz; and [l2. 

By Lemma 1.1 of [140], the integral (10.1) is well-defined and T'y(j11, w2) > 0 
with equality only when 41 = fe. Furthermore by [140], [42] Tp(41, we) does not 
depend on the choice of X. The concept of f-divergence was introduced first in [139] 
as a generalization of Kullback’s “information for discrimination” or I- divergence 
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(generalized entropy) [244], [243], and of Rényi’s “information gain” (I-divergence) 
of order a [306]. In fact the I-divergence of order 1 equals 


Dulogs u(f1; [l2). 


The choice f(u) = (u— 1)? produces again a known measure of difference of distri- 
butions that is called .?-divergence. Of course the total variation distance 


Awe I tp(2) — q(e)| ad(x) 


is equal to T),,-1\(41, 2). Here by assuming f(1) = 0 we can consider I'¢(f11, p12) 
the f-divergence as a measure of the difference between the probability measures 
1, /2- 

This is a chapter where we apply to the discrete case a great variety of interesting 
results of the above continuous case coming from the author’s earlier but recent 
articles [37], [42], [43], [41]. 

Throughout this chapter we use the following. Let again f be a convex func- 
tion from (0,+00) into R which is strictly convex at 1 with f(1) = 0. Let 
X = {x1,%2,...} be a countable arbitrary set and A be the set of all subsets of 
X. Then any probability distribution 4 on (X, A) is uniquely determined by the 
sequence P = {pj1,po,...}, pi = w({a;}), 7 = 1,.... We call M the set of all 
distributions on (X,A) such that all p;, i =1,..., are positive. 

Denote here by p41, U2 € M the distributions with corresponding sequences P = 
{pi,..-}, Q= {q1, G2,---} (pi >0,q >0,7= DD secs) 

We consider here only the set M@* C M of distributions 441, 42 such that 


O<a<ei<bd, i=1,..., on X anda<1<bs, (10.2) 
Gi 
where a, b are fixed. 
In our discrete case we choose \ such that A({z;}) = 1, alli =1,2,.... Clearly 
here j11 < A, fg < ». Therefore by (10.1) we obtain the special case of 


Ty (u1, Ha) = Yas (2) (10.3) 


So (10.3) is the discrete f-divergence or Csiszar distance between two discrete dis- 
tributions from M*, see also [139], [141]. The quantity I’, as in (10.3) is also called 
generalized measure of information. 

The above discrete general concept of I (41, 42) given by (10.3) incorporates a 
large variety of discrete specific [¢ distances such of Kullback—Leibler so called in- 
formation divergence, Hellinger, Rényi a-order entropy, 7, Variational, Triangular 
discrimination, Bhattacharyya, Jeffreys distance. For all these and a whole of very 
interesting general theory please see S. Dragomir’s important book [152], especially 
his particular article [153]. In there S. Dragomir deals with finite sums similar to 
(10.3), while we deal with finite or infinite ones. 
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The problem of finding and estimating the proper distance (or difference or 
discrimination) of two discrete probability distributions is one of the major ones 
in Probability Theory. The above discrete f-divergence measure (10.3) for various 
specific f’s, of finite or infinite sums have been applied a lot to Anthropology, Genet- 
ics, Finance, Economics, Political Science, Biology, Approximation of Probability 
distributions, Signal Processing and Pattern Recognition. 

In the following the proofs of all formulated and presented results are based a lot 
on this Section 10.1, especially on the transfer from the continuous to the discrete 
case. 


10.2 Results 
Part I. Here we apply results from [37]. 
We present 
Theorem 10.1. Let f € C'([a,b]) and tn, 2 € M*. Then 


Tp (41, 2) < Il Fllo0,[a,] (>: \pi — a . (10.4) 


i=l 


Inequality (10.4) is sharp. The optimal function is f*(y) = |y—1|%, a > 1 under 
the condition, either 
i) max(b—1,1—a) <1 and 


C := sup(Q) < +o, (10.5) 


or 
ii) 


Proof. Based on Theorem 1 of [37]. 


Next we give the more general 


Theorem 10.2. Let f € C"*+!((a, b]), n EN, such that f(1) =0, k =0,2,...,n, 
and [1,2 € M*. Then 
FFD IIo, a,b] a 
r n a (rth ’ 10. 
(Hi, M2) < Ga y: Gi” pi — al (10.7) 


Inequality (10.7) is sharp. The optimal function is f(y) := |y—1|"t*, a > 1 under 
the condition (i) or (ii) of Theorem 10.1. 


Proof. Based on Theorem 2 of [37]. 


As a special case we have 


120 PROBABILISTIC INEQUALITIES 


Corollary 10.3 (to Theorem 10.2). Let f € O?([a,b]) and 1, 2 € M*. Then 


T'p(u1, M2) < Ea ENCE bs gq, (Pi — oy) (10.8) 
’ = 5} 4 4 a 


i=1 


[ere 


Inequality (10.8) is sharp. The optimal function is f(y) :=|y—1 , a> 1 under 


the condition (i) or (ii) of Theorem 10.1. 
Proof. By Corollary 1 of [37]. 


Next we connect discrete Csiszar f-divergence to the usual first modulus of 
continuity w. 


Theorem 10.4. Assume that 


0<h:= 5° |pi— ail < min(1 — a,b—- 1). (10.9) 
i=1 
Let pi, fo € M*. Then 
T'(ui, H2) < wil f, A). (10.10) 


Inequality (10.10) is sharp, namely it is attained by f*(a) := |x — 1). 


Proof. By Theorem 3 of [37]. 


Part II. Here we apply results from [42]. But first we need some basic concepts 
from [122], [26]. 

Let vy > 0, n:= [vy] anda=v—n(0<a< 1). Let 2,20 € [a,b] C R such that 
x > Xo, where Xo is fixed. Let f € C([a,b]) and define 


x 


(Ir? f(a) -= raf (x— 8)” f(s)ds, to S2<b, (10.11) 


0 
the generalized Riemann—Liouville integral, where T stands for the gamma function. 
We define the subspace 


Cx ({a, b]) = {f € C"([a,d]): J72,D" f € C'([xo, b))}. (10.12) 


For f € Cz, ({a, 6]) we define the generalized v-fractional derivative of f over |xo, ] 
as 
DY f := DIP. f (v = +). (10.13) 
xo l-a dx 


We present the following. 
Theorem 10.5. Leta<b,1<v <2, fe C¥({a,b]), wi, u2 € M*. Then 


Ds Co,|/a ae —_ 
Py(upa) < atte (>. "(ps aa). (10.14) 


Proof. By Theorem 2 of [42]. 


The counterpart of the previous result follows. 
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Theorem 10.6. Let a <b, v > 2, n:= |v], f € CY¥([a,d]), f%(a) = 0,1 = 
0,1,...,2—1, p41, "2 € M*. Then 


| DEF lloo, fa b] — 1 
<< eo Ca NO : F 
T'(u1, 2) < Tv +1) > qG,” (pi — aq) (10.15) 


Proof. By Theorem 3 of [42]. 


Next we give an Lg estimate. 


Theorem 10.7. Let a <b, v > 1, n:= [v], f € C¥({a,b]), fO(a) = 0, i = 
0,1,...,27-1, wi, u2 € M*. Leta, 6B >1: t+ gal. Then 
|DY fll, a,b] oS Q-v—4 -i4i 
T ——— — — ; ® (p; —aqi)” ‘t2 |. 10.16 
st) S Tyg — 1) 4 DvP a4 (pi — agi) (10.16) 


Proof. By Theorem 4 of [42]. 


It follows an L., estimate. 


Theorem 10.8. Let a <b, v > 1, n := [v], f € C¥({a,b]), fP(a) = 0, i = 
0,1,...,2-—1, wi, 2 € M*. Then 


|p fl : “a ee 
Dy(q1, 2) <4 aac aie . ya (pi — agi)” ). (10.17) 


Proof. Based on Theorem 5 of [42]. 


We continue with 


Theorem 10.9. Let f € C'({a,b]), a#b, wi, 2 € M*. Then 
i b 
ere nee fla ye Soy (# :) (a pi) (10.18) 


Proof. Based on Theorem 6 of [42]. 


Furthermore we obtain 


Theorem 10.10. Let n odd and f € C"*+({a,b]), such that f+) > 0 (< 0), 
L1, ba © M*. Then 


Tp(t1, M2) < (& 


ome - Feral EEG) 


- (bq; — D3)" (aq; -»}')). (10.19) 


Proof. Based on Theorem 7 of [42]. 


Part III. Here we apply results from [43]. We begin with 
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Theorem 10.11. Let f,g € C'({a,b]) where f as in this chapter, g' #0 over [a, bj, 


L1, 2 € M*. Then 
u(#) - 7) ‘ (10.20) 


(Xe 


f' 
T'r(t1, #2) < | : 
g oo, [a,b] 


Proof. Based on Theorem 1 of [43]. 


Examples 10.12 (to Theorem 10.11). 
1) Let g(x) = 4. Then 


< qi 
P(u15 412) < lle? f'lloo,f.0 (>. Hip, - a ) (10.21) 
i=1 ©? 


2) Let g(x) = e”. Then 


T pl pe, tta)S lle? F lle, fa, 8] ( di 
i=l 


ae 4) (10.22) 
3) Let g(x) = e~*. Then 
D p(y41, 2) < lle*F' loo, (>: qi 
i=1 


4) Let g(x) = Ina. Then 


et — -) (10.23) 


in( =) ) (10.24) 


DT p(H1, M2) < IF’ Iloo,fa,0) ps di 
t=1 


5) Let g(x) = xInz, with a > e~!. Then 


f" oe (2)}) 
Petts fio) = ifInf—)\}). 10.25 
p (1, H2) ee ne (d» 7 (10.25) 
6) Let g(x) = /z. Then 
Py (tr, M2) < 2||V2Fl|00,[a,0) (> van vai) (10.26) 
4=1 


7) Let g(x) = 2%, a>1. Then 
f' 


ga-l 


T5(t1, 2) < 


a 3 : 
11 


(>: gq; “|p? - 7) ; (10.27) 
co, [a,b] 


Next we present 


Theorem 10.13. pie bj), aA D, p1, 2 € M*. Then 


c Fillo, [a,b] a 24 5 
2 


5 (1, 2) — Ss ~(b—a) 
— (a+b) + Soar (10.28) 
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Proof. Based on Theorem 2 of [43]. 
It follows 


Theorem 10.14. Let f € C®?)({a,b]), aA b, ua, 2 € M*. Then 


rane f ome (Wat) (54 


atb))? 


boryle ee Ps (2 at 
< fo Utnen | o0[ oe 7 “af ee 


Proof. Based on (33) of [43]. 


Working more generally we have 


Theorem 10.15. Let f € C({a,b]) and g € C([a,b]) of bounded variation, g(a) 4 
g(b), 1, Ho € M*. Then 


b 
Py (Hi, M2) — OErOne f(x)dg(x) 


oo b 
< les a, if P 
[a a1 | 


where, 
g(t) — g(a 
g(d)— gay’ “6S * 
P(g(z), g(t) = Pet sas (10.31) 
g(t) — 9 
abate)’ z<t<b 


Proof. By Theorem 5 of [43]. 


Example 10.16 (to Theorem 10.15). Let f € C'({a, |), a 4 b, and g(x) = e”, Ina, 
Jz, u*,a>1; 2 >0. Then by (71)-(74) of [43] we derive 
1) 


T (1, 2) — sta fs f(a )- 


2 - Whee — ~e(2—a) + (b— at. (10.32) 


1=1 


Ty (m1, 2) — (n(=) ) ie [@e 


< Mf" loon ( (m(2)) ihe n(ab) + (a+b at. (10.33) 


124 PROBABILISTIC INEQUALITIES 


3) 
T's (p1, fz) — ec 
4) (a >1) 
T's(u1, 2) — pe maw [| 1 we" de 
Wasa {2 2 5 (Dae) 
fern | (10.35) 


We continue with 


Theorem 10.17. Let f € C®)({a,b]), g € C({a,b]) and of bounded variation, 
g(a) Z g(b), fa, wo € M*. Then 


Tp (1, fa) — COEFOn aa +f 1) )dg(a 


caorcay (fem “BLL -62) 4) 
(>. 


< [flee Geel P (o().0¢e })|/Ptt. ae plane) J 
- (10.36) 
Proof. Apply (79) of [43]. 
Remark 10.18. Next we define 
PY(2,8) = a pe (10.37) 


Let f € C1({a, b]), a 4 b, 1, 2 € M*. Then by (25) of [43] we get the representation 


b 
«(ft s), (10.38) 
where by (26) of [43] we have 


Ry i= ya (/ pe (E.2) ror) : (10.39) 


Tp (141, 2) > b 
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Let again f € C1({a,]), g € C({a,b]) and of bounded variation, g(a) 4 g(b), 
H1,H2 © M*. 
Then by (47) of [43] we get the representation formula 


b 
T ¢(t1, 2) = os | f(a)dg(x) + Ra, (10.40) 


where by (48) of [43] we have 


(ee = di iL P (o().9(@)) ror) (10.41) 


Let G,8>1: + Z=1, then by (89) of [43] we have 


Ral < IlF'lla.to. (>: a p(2, ) lew) (10.42) 


and by (90) of [43] we obtain 
(2, ) | (10.43) 
G oo, [a,b] 


[Ral < WF" lltfa.e) (soalP 
6) Ia) 


Furthermore by (95) and (96) of [43] we derive 
Pi 
is (o(2).9¢9) | (10.44) 
G co, [a,b] 


[Ral < IIf'lla,fa,e) (>. di 
= 


and 
[Ral < |’ lla,fa,6) 6 qi 
j=1 


Remark 10.19. Let a < b. 
(i) By (105) of [43] we get 


Ral < IF" Ila,fa,6) (Seam (# —a,b— 2) , f €C'({a, b]). (10.45) 


i=l 
(ii) Let g be strictly increasing and continuous over [a,b], e.g. g(x) = e”, Ina, 
Jax, x with a > 1, and z > 0 whenever is needed. Also let f € C1({a,b]) and as in 
this chapter always. Then by (106) of [43] we find 


[Ral < cE a 5 (Soma ( (# ) — g(a), wo)—a(2))). (10.46) 


In particular via (107)-(110) of [43] we obtain 
1) 


|Ra| < a RCRD 1 (Soa max (e* — oe) : (10.47) 


1=1 


126 PROBABILISTIC INEQUALITIES 


2) 


fla tao. (< Pi Pi 
[Ra] < aa ae (doe. max (m(#) ~ine,ind-In(2))) : (10.48) 
3) 
|Ra| < i WAT a1 ed (Se, , max (/2- Ja, Vb — 2)) : (10.49) 


and finally for a > 1 we derive 
4) 


\Ra | < i pee a (doe max (He a) : (10.50) 
qd qd 


i=l a a 


iii) At last let @, > 1 such that = +3 =1, then by (115) of [43] we have 
> {af (pi — ag) O+1 + (bas — ps)P+} 
Ril < |If'lleta. p/ ——_—_- "ns 10.51 
[Ral < IF la.ta.0) | (3 + lai ( ) 


Part IV. Here we apply results from [41]. 
We start with 


Theorem 10.20. Let0 <a<1<b, f as in this chapter and f € C"([a,6]), n> 1 
with | f(t) — f° (1)| be a convex function in t. Let 0 < h < min(1—a,b—1) be 
fixed, pi, M2 € M*. Then 


p(H15 2) <S EO MIS gi Di — Gi) 
k=2 i=1 


wit _ n+1 
teen ae A lpi — ail ‘ (10.52) 


Here w, is the usual first modulus - ae If f)(1) =0, k=2,...,n, then 


—n n+1 
Tp(1, H2) < a ee: we (S qi" |pi — ail ): (10.53) 
Inequalities (10.52) and a when n is even are attained by 
f(t):= a <t<b (10.54) 
ate a<t<b. 


Proof. By Theorem 1 of [41]. 


We continue with the general 


Theorem 10.21. Let f € C([a,b]), n > 1, b1,u2 € M*. Assume that 
wi(f™, 6) <w, where0<6<b-—a,w>0. Let x € R and denote by 


bn(a) := ie H att (10.55) 
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where [-] is the ceiling of the number. Then 
ie vo es Di — Gi 
p(H1, M2) 2y! JS gi ‘(pi — qi)*| + w H +09 ( Soa a. ‘)) . (10.56) 
k=2 


i=1 
Inequality (10.56) is sharp, namely it is attained by the function 
fr(t) = won(t-1), a<t<b, (10.57) 


when n is even. 


Proof. By Theorem 2 of [41]. 
It follows 


Corollary 10.22 (to Theorem 10.21). Let ji1, 2 € M*. It holds (0 <6 < b—a) 


T'y(ta, 42) < >> Hi ao (> q; “(pi - o)') 
: i=l 
n 1 - —n n 
wi (f' 94 (So lpi — Gi 2) 
+ = - qe" |pi — at 


at (n— Soy wal qi Vie aS | \ (10.58) 


| 
T 


and 


Ts (t1, fa) < 
k= 


+ wi(f™, 6) mo ee "lpi — Gl ; 
" CES (0 me} (10.59) 


pi-u)* 


In particular we have 


and 


p(t, M2) < wi(f’, 6) 4 (% \pi — see Di — i) ri (10.61) 


Proof. Based on Corollary 1 of [41]. 


Also we have 
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Corollary 10.23 (to Theorem 10.21). Let p14, 2 € M*. Assume here that f‘") is 
of Lipschitz type of order a, O<a<1, ie. 

wi(f™, 5) << K6%, K>0, (10.62) 
for any0O <6 <b-—a. Then 


oe) 
PO awa) 
w=1 


Co 
(Serna) 


=2 Th (a+it) \i=1 
“ (10.63) 
When n is even (10.63) is attained by f*(x) = cla —1|"t%, where c:= K/(Tl(a+ 
i)) >0. -_ 
Proof. Based on Corollary 2 of [41]. 
Corollary 10.24 (to Theorem 10.21). Let p14, u2 € M*. Suppose that 
b02 Yeap pl (10.64) 


Then 
Py (Hi, M2) Sw ( (Soar := ‘)) {domain gh (10.65) 


Proof. Based on Corollary 3 of [41]. 


Corollary 10.25 (to Theorem 10.21). Let 1, u2 € M*. Suppose that 


S> lpi — | > 0. (10.66) 


Ty(masH2) S #1 ( is lpi — ail )) {(3 psa) ad 
< (6 a)w (r. (>: [pi — al) (10.67) 


Proof. Based on Corollary 5 of [41]. 
We give 


Then 


Proposition 10.26. Let f € C([a,b]), ui, u2 € M*. 
i) Assume that 


S> lpi — | > 0. (10.68) 


Optimal Estimation of Discrete Csiszar f-Divergence 129 


Then 


ip (pr, M2) )< 2W (: (> pi 7G )) : (10.69) 


ii) Let r > 0 and 


b-a2r (>. ba) > 0. (10.70) 
i=1 


Ty(1, ue) < (1 am ) 1 (> pe ai)) . (10.71) 


Proof. Based on Proposition 1 of [41]. 


Then 


Also we give 


Proposition 10.27. Let f as in this setting and f is a Lipschitz function of order 
a,0O<a<1l, ie. there exists K >0 such that 


If(@)- FM < Kla—yl*, all x,y € [a, bl. (10.72) 
Let pt, feo € M*. Then 


(1,2) < (>: q, *|pi — al) (10.73) 


i=l 


Proof. Based on Proposition 2 of [41]. 


Next we present some alternative type of results. 
We start with 


Theorem 10.28. All elements involved here as in this chapter and f € C"*"([a, dj), 
nEN. Let py, 2 € M*. Then 


an (sree Sa Mee ay!) (10.74) 


(Se ce (t—1)" jmoneyat)) (10.75) 


Proof. By Theorem 12 of [41]. 


Next we estimate 72. We give 


f(t, [l2) 


where 
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Theorem 10.29. All here as in Theorem 10.28. Then 


By: Il f° ve lhesst a,b] 3 md = gaan 
1: (n+ 1)! Gq, |Pi- 


WFO Ua tat) [SO an n 
By = =} De: Ipi— ail” ], 


i=l 


|wo| < min of (10.76) 


1 1 
and fora,@>1:=+=+—-=1, 
a £p 


WFO? Yaa b) [wa ien-4 aya 
3 = Nao + Ile 2 G  *Ipi— al"* J. 


Also it holds 


n—-1 ee) 
f (Ha, H2) es fee) (E)arhn — qi)*t"| + min(B1, Bo, Bs). 
| i=1 , 
(10.77) 
Proof. By Theorem 13 of [41]. 
Corollary 10.30 (to Theorem 10.29). Case of n = 1. It holds 
LF lheefevt ~ a7 2 
Biii= a dM (pi- 4)” ], 
Bar = ||P" It [a,0) (>: [pi — ol) 
lwb2| < min of oe (10.78) 
1 1 
and for a,8>1:+=+-=1, 
a 
IF llatao) ( -4 144 
B31. := ——— . i qi! F |. 
3,1 (8+1)1/8 4 |p | 
Also it holds 
¢ (L1, p2) we Vip i i — Wi) + min(B,1, B21, B31). (10.79) 


Proof. By Corollary 8 of [41]. 


We further give 
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Theorem 10.31. Let all elements involved as in this chapter, f € C"*"([a, dj), 
n EN and py, 2 € M*. Then 


f(a; [2) TS Fa mee (E)ar ar * (pi — oy) 


+ aaa ole re) d (Samer ay) fe (10.80) 
where 
Lgl Bi 
pa i= » gis (2) (10.81) 
with 
va(e) = P(e ayr(rr*(a) = FM Aa)yas. (10.82) 


Proof. Based on Theorem 14 of [41]. 


Next we present estimations of w4. 


Theorem 10.32. Let all elements involved here as in this chapter and f € 
On} ([a, bl), n EN, p1, fe € M* and 


aay (da 1s — r=?) Sha (10.83) 


Then (10.80) is again valid and 
Idal < Tw es : qo" "Pi 4 rs?) 


+ 2) 


Proof. Based on Theorem 15 of [41]. 


Also we have 


Theorem 10.33. All elements involved here as in this chapter and f € C"**({a, b]), 
n EN, 1, M2 € M*. Furthermore we assume that |f+(t) — f*)(1)| is conver 
int and 


1 co 
OS ae ay (>. @. 9 pes a2] < min(1 —a,6— 1). (10.85) 


mi(n + 2) \44 
Then (10.80) is again valid and 


|a| < wr (so), HEE (>. Ge” "pi a?) (10.86) 


132 PROBABILISTIC INEQUALITIES 


The last (10.86) is attained by 
(s—1)"*? 


f(s) = “+a? acscb, (10.87) 


when n is even. 


Proof. By Theorem 16 of [41]. 


When n = 1 we obtain 


Corollary 10.34 (to Theorem 10.32). Let f € C?([a,b]), wi,u2 € M* and 


1 = 3 
Be (>: gq; |pi — ail <b =a. (10.88) 


Then 
f (Ha; #2) =D Vio — 4) — rm (Seat +41, (10.89) 
where 
_< Di 
Wa. i= Dao (2) : (10.90) 
with 
vale) =— f @-E"O-FO)d, ce lad. (10.91) 


Furthermore it holds 


baa| < wr ( ne 5( qi |pi -a)) (> Gp ‘ + ‘) (10.92) 


Proof. Based on Corollary 9 of [41]. 


Also we have 


Corollary 10.35 (to Theorem 10.33). Let f € C?([a,b]) and |f"(t) — f’"(1)| és 
conver int. Here p14, 2 © M* such that 


(ea 5 (Soa pi — | ’ < min(1 — a,b 1). (10.93) 


Then again (10.89) is valid and 


24,1 can (rr 5 (Sat Di — Gil )) (10.94) 


Proof. By Corollary 10 of [41]. 


The last main result of the chapter follows. 
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Theorem 10.36. All elements involved here in this chapter and f € C"*'({a, b]), 
neEN, p11, po € M*. Additionally we assume that 


JF? (a) — Ff) < Ke — gl, (10.95) 
K>0,0<a<l,allaye 7 with (x —1)(y—1)>0. Then 


V5 (41, 2) = aa oo T(t k+1) (2) (pi - ait | 


-1 
: oni fern o (dare gy "(Di - ay] +4. (10.96) 
Here it holds 
[al S nn oe 1) as > qe” (pi - ane (10.97) 
Inequality (10.97) is attained when n is even by 
f*(a) =ea—1pre*", we [a,)] (10.98) 
where 
K 
C= a (10.99) 


[I(nta+1—J) 
j=0 


Proof. By Theorem 17 of [41]. 


Next we give 


Corollary 10.37 (to Theorem 10.36). Let f € C?({a, b]), 11, u2 € M*. Additionally 
we suppose that 


rwo=2'*@)| <Kle—al* (10.100) 
K>0,0<a<l, all x,y € [a,b] with (cx —1)\(y—1) > 0. Then 


f(H1, M2) =>s (Z)e 2G) = ru (Soar -0)") +Wa1- (10.101) 


i=1 


Here 


Yar = 2 Gis, (2) (10.102) 


t=1 
with 


W3,1(2) = — fe — 1)(f"(t) — f’))dt, x € [a,b]. (10.103) 
It holds that 


lwail < ( 


j (> Ge pi - ai) (10.104) 
w=1 


Proof. By Corollary 11 of [41]. 
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Chapter 11 


About a General Discrete Measure of 
Dependence 


In this chapter the discrete Csiszar’s f-Divergence or discrete Csiszar’s Discrimina- 
tion, the most general distance among discrete probability measures, is established 
as the most general measure of Dependence between two discrete random variables 
which is a very important matter of stochastics. Many discrete estimates of it are 
given, leading to optimal or nearly optimal discrete probabilistic inequalities. That 
is we are comparing and connecting this general discrete measure of dependence to 
known other specific discrete entities by involving basic parameters of our setting. 
This treatment is based on [38]. 


11.1 Background 


The basic general background here has as follows. 

Let f by a convex function from (0,-+00) into R which is strictly convex at 1 
with f(1) =0. 

Let (Q, A, A) be a measure space, where \ is a finite or a o-finite measure on 
(Q, A). Let ji, “2 be two probability measures on (Q, A) such that 1 << A, wo <r 
(absolutely continuous), e.g. A = ti + pa. 

Denote by p = ta q= die the (densities) Radon—Nikodym derivatives of ,11, 
ba with respect to A. Typically we assume that 


j2ge 25. ae. on Qanda<1<b. 
qd 


The quantity 
p(x) 
T (pr, = [or (2) aw, 11.1 
f(t, H2) 5c) ala} (x) (11.1) 
was introduced by I. Csiszar in 1967, see [140], and is called f-divergence of the 


probability measures jz; and [l2. 


By Lemma 1.1 of [140], the integral (11.1) is well-defined and T'y(11, 2) > 0 with 
equality only when #4; = fe. Furthermore by [140], [42] T'-(441, w2) does not depend 
on the choice of ». The concept of f-divergence was introduced first in [139] as a 
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generalization of Kullback’s “information for discrimination” or I-divergence (gen- 
eralized entropy) [244], [243], and of Rényi’s “information gain” (I-divergence) of 
order a [306]. In fact the I-divergence of order 1 equals 


Du tog, u(f1, [l2). 


The choice f(u) = (u—1)? produces again a known measure of difference of distri- 
butions that is called .?-divergence. Of course the total variation distance 


tase i bp(2) — q(2)| aX(a) 


is equal to T),,1\(41, #2). Here by assuming f(1) = 0 we can consider I¢(f11, p12) 
the f-divergence as a measure of the difference between the probability measures 
1, 2. 

This is an expository, survey and applications chapter where we apply to the 
discrete case a large variety of interesting results of the above continuous case com- 
ing from the author’s papers [37], [42], [43], [41], [39]. Namely here we present a 
complete study of discrete Csiszar’s f-divergence as a measure of dependence of two 
discrete random variables. 

Throughout this chapter we use the following. 

Let again f be a convex function from (0,-+00) into R which is strictly convex 
at 1 with f(1) =0. 

Let the probability space (Q, P) and X, Y be discrete random variables from 2 
into R such that their ranges Rx = {x1,%2,...}, Ry = {y1, y2,.--}, respectively, 
are finite or countable subsets of R. 

Let the probability functions of X and Y be 


pi = p(a;) = P(X =2;), 1=1,2,..., 


respectively. Without loss of generality we assume that all p;,q; > 0, 7,7 =1,2,.... 
Consider the probability distributions uxy and px xX Wy on Rx xX Ry, where 
Lexy, x, fy stand for the joint probability distribution of X and Y and their 
marginal distributions, respectively, while j1x x py is the product probability dis- 
tribution of X and Y. Here xy is uniquely determined by the double sequence 


tig = U(ai, yj) = P(X = 2i,Y = y;) =exy({ (ti, ys)}), 4 7=1,2,..., (11.3) 


which t is the probability function of (X,Y). 

Again without loss of generality we assume that tj; > 0 for all 7,7 = 1,2,.... 
Clearly also here ux, fry are uniquely determined by p; = wx ({zi}), 7 = wy ({y;}), 
respectively, 7,7 = 1,2,.... It is obvious that the product probability distribution 
[Lx X fy has associated probability function pq. 

In this article we assume that 


ti; oy 
O<a<—<b, alli,j=1,2,... anda<1<b, (11.4) 
PiDj 
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where a, b are fixed. 

Let P be the set of all subsets of Rx x Ry. We define A to be a finite or o-finite 
measure on (Rx xX Ry,P) such that A({(xi, y:)}) = 1, for all 7,7 = 1,2,.... We 
notice that uxy and wx Xx py are discrete probability measures on (Rx x Ry, P) 
and are absolutely continuous (<) with respect to A. 

By applying (11.1) we obtain the special case 


Ty(uxy, ux x by) =S°S° pias f (==) (11.5) 
i=1 j=l Pid 
This is the discrete Csiszar’s distance or f-divergence between xy and [x X My. 
The random variables X,Y are the less dependent the closer the distributions uxy 
and fsx X py are, thus ['¢(fxy, ux X fey) can be considered as a measure of de- 
pendence of X and Y, see also [305], [140], [39]. 

From the above derives that Ts(uxy,ux x wy) > 0 with equality only when 
xy = x X py, ie. when X,Y are independent discrete random variables. As 
related references see here [139], [140], [141], [152], [244], [243], [305], [306], [37], 
[42], [43], [41], [39]. 

For f(u) = ulog, u we obtain the mutual information of X and Y, 


(X,Y) = I(uxy || ux x by) =Tutog, u(uxy, ux X by), 
see [140]. For f(u) = (u— 1)? we get the mean square contingency: 
(X,Y) =Ta-12(uxy, ex X by), 


see [140]. 

In Information Theory and Statistics many various divergences are used which 
are special cases of the above I’, divergence. I's has many applications also in 
Anthropology, Genetics, Finance, Economics, Political Science, Biology, Approxi- 
mation of Probability Distributions, Signal Processing and Pattern Recognition. 

In the next the proofs of all formulated and presented results are based a lot on 
this Section 11.1 especially on the transfer from the continuous to the discrete case. 


11.2 Results 


Part I. Here we apply results from [37], [39]. We present the following 
Theorem 11.1. Let f € C'({a,b]). Then 


Dy(uxy, ux x by) < |If"lloo,fas) | >> SS Its — pial | - (11.6) 


i=1 j=1 


Inequality (11.6) is sharp. The optimal function is f*(y) = |y—1|*, a > 1 under 
the condition, either 


138 PROBABILISTIC INEQUALITIES 


i) 
max(b—1,1—a)<1 and C:= sup (piq;) < +00, (11.7) 
or a 
ii) 
Iti — Digg] <pigg <1, alli,7 =1,2,.... (11.8) 


Proof. Based on Theorem 1 of [37] and Theorem 1 of [839]. 


Next we give the more general 


Theorem 11.2. Let f € C*1((a,b]), n € N, such that f“(1) = 0, k = 
0,2,3,...,n. Then 


If jem ‘eg: [a,b] n 
Ts(uxy, bx X by) < ai Sa qj” tig — Pidy| elie, “VELO) 
t=1 g=1 


Inequality (11.9) is sharp. The optimal function is f(y) := \y—1|"t%, a> 1 under 
the condition (i) or (tt) of Theorem 11.1. 


Proof. As in Theorems 2 of [37], [39], respectively. 


As a special case we have 


Corollary 11.3 (to Theorem 11.2). Let f € C?({a,b]). Then 


WFP Mo [a,b] a [a,b] 
Ty(uxy, bx x py) < ee q; (ty —pigy)? |. (11.10) 


w=1 j=1 


Inequality (11.10) is sharp. The optimal function is f(y) := |y—1|+%, a > 1 under 
the condition (i) or (tt) of Theorem 11.1. 


Proof. By Corollaries 1 of [37] and [839]. 


Next we connect Csiszar’s f-divergence to the usual first modulus of continuity 


Wy. 


Theorem 11.4. Suppose that 


0<hi= ene < min(1 —a,b-1). (11.11) 
i=1 j=l 
Then 
Te(uxy, ex X py) Swi(f,h). (11.12) 
Inequality (11.12) is sharp, t.e. attained by 
f*(@) =|e- 1) 


Proof. By Theorems 3 of [37] and [39]. 
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Part II. Here we apply results from [42] and [39]. But first we need to give some 
basic background from [122], [26]. Let v > 0, n := [vy] anda =v—n(0<a< 1). 
Let x,2o € [a,b] C R such that x > wo, where %o is fixed and f € C([a,b]) and 
define 


x 


(J7° f(z) = ra (c—s)’'f(s)ds, a <a<b, (11.13) 


0 
the generalized Riemann-Liouville integral, where T stands for the gamma function. 
We define the subspace 


Cz ([a, 8) = {f € C"([a, 8) : IZ f € C?([x0, B))}- (11.14) 


For f € Cz, ({a,6]) we define the generalized v-fractional derivative of f over [xo, b| 
as 
d 


Depa Diet (v = =) ; (11.15) 


We present the following 
Theorem 11.5. Leta <b, 1<v <2, f © C¥([a,b]). Then 


LOS Phase fw wy : 
TeEeD | >. (Pi;)" (tig —apiqy)” |. (11.16) 


ij=l 


Ty(uxy, x X py) < 


Proof. It follows from Theorem 2 of [42] and Theorem 4 of [39]. 
The counterpart of the previous result comes next. 


Theorem 11.6. Let a < b, v > 2, n:= [|v], f € C¥({a,b]), fO(a) = 0, 7 = 
0,1,...,2-—1. Then 


Eien eee ; 
er S- (pigg)' "(tig — api)” |. (1.17) 


ij=l 


Ty(exy, ux X py) < 


Proof. By Theorem 3 of [42] and Theorem 5 of [39]. 


Next we give an Lg estimate. 


=1. Then 


CO 

—p—4t = 1 
S > (pig)? °F (tig — apiqg)”F 8 
i,j=l 


DoF lla.ta,b) 
Dy(uxy, ux Xpy) < TO\(6e - 1) +)V8 


(11.18) 


Proof. Based on Theorem 4 of [42] and Theorem 6 of [39]. 


It follows an L., estimate. 
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Theorem 11.8. Leta <b,v >1, 7 := |v], f € CY¥([a,d]), f%(a@) = 0,1 = 
0,1,...,2-—1. Then 


DY Flla tae [<= oa aes 
Py(uxy, ux x wy) < AE > (pia)? (tig — apiqy)”1 |. (11.19) 


ij=l 


Proof. Based on Theorem 5 of [42] and Theorem 7 of [39]. 


We continue with 


Theorem 11.9. Let f € C'([a,b]), a<b. Then 


45 b 
Ty(uxy, ux X py) <i fn fi(zjde— 7 (=) (Bae -4:)) 


tj=l 


(11.20) 


Proof. Based on Theorem 6 of [42] and Theorem 8 of [39]. 


Furthermore we get 


Theorem 11.10. Let n odd and f € C"t*({a,b]), a < b, such that ft) > 0 
(<0). Then 


Ty(uxy, ex X by) < 22 aes af sou 


n 


ae 


 (peqg)'~* (bpedy — tes)" (apegy — +) . (11.21) 


(yer (2) 


k=0 \@=1 j=l Ped; 


Proof. Based on Theorem 7 of [42] and Theorem 9 of [39]. 


Part III. Here we apply results from [43] and [39]. 
We start with 


Theorem 11.11. Let f,g € C'({a,b]) where f as in this chapter, g' 4 0 over [a,b]. 
Then 


fi oS ti; 
Ty(uxy, bx X by) < 5 by pigg |g —+ } — 91) J - (11.22) 
co, [a,b] ij=l Pid; 


Proof. Based on Theorem 1 of [43] and Theorem 10 of [39]. 
We give 


Examples 11.12 (to Theorem 11.11). 
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1) Let g(z) = 4. Then 


Ts(uxy, ux X py) < [|e f"lloo,[a,0] 3 7 be Sie | | (11.23) 
ij=l 
2) Let g(a) = e*. Then 
Ty(uxy, ux X py) < lle7* f'lleo,fa,0) (s pigs let /Pi% —el]. (11.24) 
i,j=l 
3) Let g(a) = e~*. Then 
Ds(uxy, ex x py) S lle" fll oofa,t) 3 pia ex ele er"! |e (11.25) 
ij=l 


4) Let g(x) = Ina. Then 


in( bis )| , (11.26) 
PiDj 


co 
T'y(uxy, ux X My) < |l2F"lloo,ta.t) | D> pid 
i,j=l 


5) Let g(x) = xInz, with a > e~!. Then 


Co 


de ti 


oo,[a,b] \ 4 j=1 


/ 


in( tis ) , (11.27) 
Pidj 


T x < |} ——_— 
f(MXYs Lx X py) < Ife 


6) Let g(x) = /z. Then 


Tp(uxy, ux X py) < 2I|Vef"lloo,fas) | >) VPiG| Vt — VPiG|}- (11.28) 


ij=l 
7) Let g(x) = 2%, a>1. Then 
1 ie ~ l-a|,a a 
Ts(uxy, ex X by) < alee S- (pigs) “|t8 — (piggy )*1 | - (11.29) 
co,[a,b] \i,j=1 


Next we give 


Theorem 11.13. Let f € C'({a,b]), a< b. Then 


1 b 
Cie ie) = i f(a)de 


b-—a 
IF llesstese a? + b? oo 7 
5 bai ar ese a SS ig) || GL80) 
ij=l 


Proof. By Theorem 2 of [43] and Theorem 11 of [39]. 
It follows 
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Theorem 11.14. Let f ¢ C®)({a,b]), a <b. Then 
: b) — a+b 
Ty(uxy, Mx X py) — —/ f(s)ds — (a) (1 = at) 


= =) 8 


~ (b= 4) 
~ 2 


le lesctacay — Pidj 


4,g=1 


(b— a)? 4 12 


(11.31) 


Proof. Based on (33) of [43] and Theorem 12 of [39]. 


Working more generally we have 


Theorem 11.15. Let f € C*({a,b]) and g € C({a,b]) of bounded variation, g(a) 4 
g(b). Then 


b 
Dy(uxys ux x by) — FOE OnE f(s)dg(s) 


oe b 
< IF" Iloo,fa,b) 1 Pid; (/ 


t,j=l 

where 
g(s — (a) Wee 
g(b) — g(a) 

P(g(z),.9(8)) = (11.33) 

g{s) = (0) Bes < b. 
g(b) — g(a) 

Proof. By Theorem 5 of [43] and Theorem 13 of [39]. 

We give 


Examples 11.16 (to Theorem 11.15). Let f € C({a,0]) and g(x) = e”, Ina, /2, 
x*,a>1;2>0. Then by (71)-(74) of [43] and (30)—(33) of [39] we obtain 
1) 


Tp(uxy, bx X py) ata fs f(s)e*ds 


< oe 3 S piggett/?% — (2 —a) +e°(b— a}. (11.34) 


4,g=1 


2) 


Ty(uxy, ex X by) — (n()) ff MS) 


» tig in( 72 


< Wf"lojan (0(2)) {: (5 : -)) min soss-ayh 


(11.35) 
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3) 


b 
Ty(uxy, ex X by) — woe fas 


g Wet aia f (s ny ee a a} 


4,9=1 
(11.36) 
4) (a>1) 
Tp(uxy, ux X py) iw | 10 f(s)s°*ds 
lat llsecta Bi 2 = : a4.a 
= (b* — a®) _ (a+ 1) fin a] 
+4 (35) ore +) or 4h (11.37) 


We continue with 


Theorem 11.17. Let f € C)({a,b]), g € C([a,b]) and of bounded variation, 
g(a) # 9(6). Then 


T's(uxy, ux X by) 


aoraaen f £6 maa =a aca (fone) 
{Sle( Fos ny 
< [FP llee,fa, 1d olf P(g ()..« »)) [Pats ats) 


(11.38) 
Proof. Apply (79) of [43] and Theorem 14 of [39]. 


Remark 11.18. Next we define 
x s—a, sé€a,z], 
P = 11. 
2) {: —b, s€(z,). aro) 
Let f € C1({a, b]). Then by (25) of [43] we get the representation 


b 
Dy(uxy, wx X py) = — (/ Heys +R,) 
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where according to (26) of [43] and (36) of [39] we have 


. 3 ia (s (+s) rt) (11.40) 


4,g=1 
Let again f € C'({a,b]), g € C({a,b]) and of bounded variation, g(a) 4 g(b). Then 
by (47) of [43] we get the representation 
1 b 
Pyluxysnx x wy) =e | fl s)dgls) + Ra, (11.41) 
, (9() — 94) Ja 
where by (48) of [43] and (38) of [39] we obtain 


ae > Didi (i (o( =). a(s)) rs) . (11.42) 


t,j=1 


Let 4,8 >1: 44+ a= = 1, then by (89) of [43] and (39) of [39] we have 


oo : ie 
[Fea LF Natok S- DPidy P (=*..) | : (11.43) 
B,[a,b] 


Jj 
UAd oo,[a,b 


Furthermore by (95), (96) of [43] and (41), (42) of [39] we find 


Ral <f'lawai { D> poll? (o().009) haw) / (1145) 


and by (90) of [43] and (40) of [39] we get 


[Ral < IF" lla,fa,6) (x Didy 


ij=l 


(11.44) 


i,j=1 


and 


Ral < Wf Ila,fa,0) S- PiGy 


tjal 


Remark 11.19. Let a < bD. 
(i) Here by (105) of [43] and (43) of [39] we obtain 


be 
Ral <U'llan (So poapmax (AL —a,b- A) ) fe oa). (1.47 
ea, Pid Pid 
(ii) Let g be strictly increasing and continuous over [a,b], e.g., g(x) = e”, Ina, 
Jz, x* with a > 1, and x > 0 whenever is needed. Also f € C1({a,b]). Then by 
(106) of [43] and (44) of [39] we find 


ais — sta (s Did; max (4 (s+) — g(a), 9(0) -o(#4))] 


t,j=1 
(11.48) 
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In particular via (107)—(110) of [43] and (45)—(48) of [39] we obtain 
1) 


[Ral < —— Ih (zen S > pigs max (es /P!% — @%, e? — efi /Pias) | , (11.49) 
€ 
i,j=1 
2) 
Fh [0,6] ( (+2 -) ( tij )) 
Ral << max [ In —Ina,Inb— Inj —+ 
[Ral S (Inb — Ina) xe a Pidy Didi 
(11.50) 
3) 


IF’ Ih1,f0,0) - ti; bij 
(ah eee pigd mide | pS 4/6, Vb) bs 11.51 
[Ral Vvb— fa pe , Pig Pid; ( ) 


and finally for a > 1 we obtain 


4) 
ir Il1,[0,2) | = ( ti ty 
Ra < el piq; max { —%— — 97, p79“ _)]. ase 
[Ra| — ae ey : (pig) (piqy)* ( ) 
At last 


iii) Let &,@ > 1 such that 4 + 3 = 1, then by (115) of [43] and (49) of [39] we 
have 


[Ral < [If 'lla,fa,0) eS ( 


ij=l 


(tag — apiag P+ (bpiag — tig) PP (11.53) 
(8+ 1)pig; : . 


Part IV. Here we apply results from [41] and [39]. 
We start with 


Theorem 11.20. Let a < b, f € C"([a,b]), n > 1 with |f™(s) — f(™(A)| be a 
convex function in s. Let0<h < min(1—a,b—1) be fixed. Then 


Ts(uxy, x X by) Zag eID >> (Digg) * (tig — piggy)” 
a2 


wi(f™, h) oe ee eres 
oF (n+ 1)!h S~ (pias) |tij Dids| i (11.54) 


i,j=l 
Here w, is the usual first modulus of continuity. If f® (1) =0,k=2,...,n, then 


W ae h See) Sie r 
ae S (pias) " [tag — pegy|"** | - (11.55) 


ij=l 


Ts(uxy, px X by) < 
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Inequalities (11.54) and (11.55) when n is even are attained by 


ie Js —1|"*1 


Proof. Apply Theorem 1 of [41] and Theorem 15 of [39]. 


We continue with the general 


Theorem 11.21. Let f € C"([a,b]), n > 1. Assume that wi(f\”,6) < w, where 
0<d<b-—a,w>0. Let x € R and denote by 


ll 97 (lal — 9) 
= z | Se ae 11.57 
Pn(z) | 5 aoe et) 
where [-] is the ceiling of the number. Then 
- zal = 1. \1-k (4... yk 
Tp(pexy, ex x ty) <p Ful >} (piq;) (ti; zs Pid;) 
k=2 i,j=l 
v8. tis — Didi 
+wl >> piaien (Sate 1) , (11.58) 
195 


Inequality (11.58) is sharp, namely it is attained by the function 


fn(s) = won(s — 1), a<s<b, (11.59) 


when n is even. 


Proof. Based on Theorem 2 of [41] and Theorem 16 of [39]. 
It follows 


Corollary 11.22 (to Theorem 11.21). It holds (0 <6 <b-—a) 


f 7 
Dy(uxy. ux X py) < >> rw S- (pias) * (tig — pig)® 


Co 


n —n\4.. [atl 
+ wi (fi 4 TG S= (piggy) [tig — Digg! 


ij=l 


CO 


1 = 
Te Y= (Pigg) tig — Pigg” 


ij=l 


eons + —n n- 
ae 8(n — 1)! yo, (oie) |ti; — pigs * \ (11.60) 


a,j=1 
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and 
Tp(uxy, bx X by) < ew S- (piqg)'—* (tig — piggy)” 
k=2 : i,j=l 
n 1 = —Tt n 
+ wi(f' 54 5 > (pig) [tig — Pigg 
* \Gj=l 
1 loo) 
—__—__ Ma) ig pag. (Kel 
Se aI 2, oa) |tis — pias (11.61) 
In particular we have 
ee gece 7 
Ty(uxy, ex X py) <wi(f',d)4 = (piggy) (ti — Dig)? 
24 i,j=l 
1f— 6 
as 3 a ts; — pigg| | + a (11.62) 
ij=l 


and 


Dy(uxy, ux X by) S na ys ltis — Digg 


ijal 


o x dS (pigs) * (ti — piggy)? \ (11.63) 


ij=l 


Proof. By Corollary 1 of [41] and Corollary 2 of [39]. 


Also we have 


Corollary 11.23 (to Theorem 11.21). Assume here that f(™ is of Lipschitz type 
of ordera, O<a<1, ie. 

wi(f™, 5) < K5%, K>0, (11.64) 
for any0 <6d<b-—a. Then 


Ty(uxy,bx X Hy) < S- yew Ds (pigy)'* (tig — Diay)* 
k=2 : 4j=1 


Co 


_ xk —n—-a N+a 
aS S~ (pigg) "Itz — piggI"** |. (11.65) 
I](a+i) \es=1 
i=l 
When n is even (11.65) is attained by f*(x) = cla —1|"*%, where c:= K/( 
i)) >0. 


(a+ 
1 


n 


a 
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Proof. Application of Corollary 2 of [41] and Corollary 3 of [39]. 


Next comes 


Corollary 11.24 (to Theorem 11.21). Assume that 


b-a> | S- (pigy) “42, | —1>0. (11.66) 
ij=l 
Then 
Co _ Co 1 
Ty(uxy, bx X py) Sor | & (pigy) "ti; — 1) > |tij a) 5 
ij=1 ij=l 
(11.67) 
Proof. Application of Corollary 3 of [41] and Corollary 4 of [39]. 
Corollary 11.25 (to Theorem 11.21). Assume that 
S > |teg — pigy| > 0. (11.68) 
ij=l 
Then 
Ty(uxy, wx X by) Swi | f’, ye ltij — Pigs S, tig — Digg| | + 5 
‘j=l ij=l 
3 we 
<5 (b-a)ur | f, S- Itiy — vial | | - (11.69) 
ij=l 
Proof. Application of Corollary 5 of [41] and Corollary 5 of [39]. 
We present 
Proposition 11.26. Let f € C([a, 5). 
i) Suppose that 
S > |tez — pigy| | > 0. (11.70) 
ij=l 
Then 
Dy(uxy, ex X by) S 2a | f, S° ti; — pigy| | ] - (11.71) 
ij=l 
ii) Let r > 0 and 
b-a>r| >> lt —pigy| | > 0. (11.72) 


ij=l 
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Then 


1 co 
yuxvixxar)<(142)er (fr do lts-pal]}. 73) 


ij=l 


Proof. By Propositions 1 of [41] and [39]. 
Also we give 

Proposition 11.27. Let f be a Lipschitz function of ordera,0<a<1, ie. there 

exists K > 0 such that 


|f(z) -— f(y)|< Klz—yl|%, all 2,y € [a, 8). (11.74) 
Then 
Dy(uxy.ux x py) <K | S> (pigg) tig — pigyl® | (11.75) 
i,j=l 


Proof. By Propositions 2 of [41] and [39]. 


Next we present some alternative type of results. We start with 


Theorem 11.28 Let f € C"*'({a,b]), n EN, then we get the representation 


n-1 foe) 
(—1)* (k+1 ( ti; ) = 
ly » WX X = TAA )( 1 ) (piqy)~* (tig — pigy)*** | +e, 
(xy, Lx X Hy) » ik DI 28 ne (pidy) “(tig — Didy) v2 
(11.76) 
where 
mie of 2 (aa) 
= ES waa) | PP -aryerM(oas} J. a.77) 
me ij=l 1 
Proof. By Theorem 12 of [41] and Theorem 17 of [39]. 
Next we estimate w2. We present 
Theorem 11.29. Let f € C"*!({a,b]), ne N. Then 
TE aSsauaes | eecocie ie 
Ba See gi) ti; — pigg|"*" 
1 (n +1)! Xe qj) | J Dp ql ) 
FFP IIa, a,b = —n n 
By = ae S- (piqy)' tag — pial” J; 
|wo| < min of ag 
~ 1 1 
and fora, B>1:=+—-=1, 
a fp 
FF IVa, a,b _ ee n 
Bg = i |S wig) tag — gy? 


n\(nB +1)1/8 fem 


(11.78) 
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Also it holds 


n-1 fore) 
1 ty 
r < (k+1) ay Gee 
f(xy, Hx X py) < dX (+! 2) an (pigs) "(tig — Digs) 
+ min(B1, Bo, Bs). (11.79) 


Proof. See Theorem 13 of [41] and Theorem 18 of [39]. 
We have 


Corollary 11.30 (to Theorem 11.29). Case of n= 1. It holds 


va If" Iloo,fa,b] = = 2 
By := — 7 S- (piq3)” |tig — Diag” J, 


ij=l 


Bor = IF"lli tay | >> Its — Digl J 
l2| < min of me (11.80) 
a 


and fora, B>1,-=+—-=1, 
a GB 


WF Nhetav) | 2 
B31 = eaten De (pid; ) MP lt, paar 


H 
Ce a oe 
Also it holds 
= {tig ; 
Ty(uxy, ux X py) < S- f Ba; (ti; — pigy)| + min(B,1, B21, B31). (11.81) 
1) 


a,j=1 


Proof. See Corollary 8 of [41] and Corollary 6 of [39]. 
We further give 


Theorem 11.31 Let f € C"*!({a,b]), n EN, then we get the representation 


= k 
Ts(uxy, ex X by) -> 6 a 3 f alee 


=0 t,j=l 


-) (Digg) "(tig — piag)*** 


—1)" . —n n 
+ Ber fd) S (piq;) " (tij — pigy)"T* | + 4, 
, t,j=l 


(11.82) 


where 


Ws = > Pigg V3 (Se 73 J; (11.83) 


t,j=1 
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with 


cone fe = 1)" (fY(s) — fF (1)) ds. (11.84) 


nl 


Proof. Based on Theorem 14 of [41] and Theorem 19 of [39]. 


Next we present estimations of wW4. 


Theorem 11.32. Let f € C"*1({a,b]), n EN, and 


1 ~ —n-1 42 
7 a igj)  |te; — paqy |” <b-a. 11.85 
0< (n +2) Xe qj) lts3 — Dig, a ( ) 


Then (11.82) is again valid and 


1 1 — Se n 
wa] < 71 (s%, SS (pia) tg — Bigg"? 


PTA) aes 
1 - =F n 
ae @an) SS (pidy) "|tig — piggl”™ | | - (11.86) 
i,j=l 


Proof. By Theorem 15 of [41] and Theorem 20 of [39]. 


Also we have 


Theorem 11.33. Let f € C"+1({a, b]), n EN, and |f@+(s)— f("t1(1)| is conver 
in s and 


1 = —n-1 n+2 : 
0< nln +2) 2 a) |tiy — pial < min(1 —a,b— 1). (11.87) 


Then (11.82) is again valid and 


Co 


1 


[bal Sor | FOF, nin +2) Dm (pia3) [tag — piggy" *? : (11.88) 
: ij=l 
The last (11.88) is attained by 
" s—l n+2 
f(s) = an ; a < S < b (11.89) 


when n is even. 


Proof. By Theorem 16 of [41] and Theorem 21 of [39]. 


When n = 1 we obtain 


Corollary 11.34 (to Theorem 11.32). Let f € C?({a,b]) and 


Co 


1 z 
0< 3 S> (pidy) 7 [tig — pig | <b-a. (11.90) 


ij=l 
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Then 
Trp(uxy, Mx X py) = S- f (==) (tiy — Pigy) 
i PiQj 
al oo = 
es rt S~ (pigg) "(tig — pigg)? | + P41, (11.91) 
‘j=l 
where 
das = >> riggs (+) , (11.92) 
fem DG 
with 
ea) ==} (s —1)(f"(s) — f"(1))ds,  @ € [a, 8}. (11.93) 
1 
Furthermore it holds 
1 _ 1 = 7 
baal Sor | f", 3 S- (pidy) 7 [tig — pial” 5 S- (pidy) ‘ti; | +1 
i,j=l tga 


Proof. See Corollary 9 of [41] and Corollary 7 of [39]. 


Also we have 


Corollary 11.35 (to Theorem 11.33). Let f € C?({a,b]), and | f(s) — f?)(1)| is 


conver in s and 


CO 


1 2 . 
0< 3 S- (p:qj) 7 \ti; — pigy|® | < min(1 — a,b — 1). (11.95) 


ij=l 
Then again (11.91) is true and 


Co 


1 re 
Parl sor] f", 3 S (pigs)? [tig — viggl® | | - (11.96) 


ij=l 


Proof. See Corollary 10 of [41] and Corollary 8 of [39]. 
The last main result of the chapter follows. 
Theorem 11.36. Let f € C"*!({a,b]), n EN. Additionally assume that 
pra) = Fy) |< Ale yl, (11.97) 
K>0,0<a<l, all x,y € [a,b] with (cx —1)\(y—1) > 0. Then 


n-1 oe) 
(—1)* (r+) (tig -k k+1 
Tys(pxy, ux X py) = hoa ——— } (pads) * (tig — Pigs) 
ser! (k+1) ei Did; 
+ DF pom (52 Grass — pia? | + de 
(n+! gj j 5 


ij=l 


(11.98) 
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Here we find that 


Kk = — FL Oe nea 
| < S- (pias) fespiGl tO |. (11.99) 
i,j=l 


aintatl 


Inequality (11.99) is attained when n is even by 


f*(z) =@a—1|"**t', x € [a,}] (11.100) 
where 
é:= — “is (11.101) 
I[m+at+1- J) 
j=0 


Proof. By Theorem 17 of [41] and Theorem 22 of [39]. 


Finally we give 


Corollary 11.37 (to Theorem 11.36). Let f € C?({a,b]). Additionally assume that 
lf" (2) — f"(y| < Kla-y|*, K>0,0<a<l, (11.102) 

all x,y € [a,b] with (c« —1)(y—1) > 0. Then 

oe 

Pidy 


Ty(uxy,ux xy) = >> f ( 


i,j=l 


) (tig — Didj) 


Yo (ig) Mtig — Pigg)? | + 4a. (11-103) 


2 = 
j=l 
Here 
Wai i= S- PiGG 3,1 (=) (11.104) 
es PQ) 
with 
03,1(2) = -| (s—1)(f"(s) — f”(1))ds, x € [a,b]. (11.105) 
1 
It holds that 
K — —l-a a+2 
lai] < (+2) S- (pigs) \tiy — Pigyl ; (11.106) 
ij=l 


Proof. See Corollary 11 of [41] and Corollary 9 of [39]. 
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Chapter 12 


Holder-Like Csiszar’s f-Divergence 
Inequalities 


In this chapter we present a set of a large variety of probabilistic inequalities re- 
garding Csiszar’s f-divergence of two probability measures. These are in parallel to 
Holder’s type inequalities and some of them are attained. This treatment is based 
on [40]. 


12.1 Background 


Here we follow [140]. 
Let f be an arbitrary convex function from (0,+00) into R which is strictly 
convex at 1. We agree with the following notational conventions: 


f(0) = lim, F(u), 


0. fF (3) =, (12.1) 


of (5) = Jim ef (2) =a lim fu) (0 <a < +00). 


E u>+oo UW 
Let (X,.A, 2) be an arbitrary measure space with \ being a finite or o-finite mea- 
sure. Let also ju, fg probability measures on X such that f41, w2 << » (absolutely 
continuous). 
Denote the Radon—Nikodym derivatives (densities) of 4; with respect to by 
pi(x): 


pi(x) = ao ~ wo, 2 
The quantity (see [140]). 
Pyluasna) = [pales (AS) acer) (12.2) 


is called the f-divergence of the probability measures jz; and fg. The function f is 
called the base function. By Lemma 1.1 of [140] T’¢(11, w2) is always well-defined 
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and T'(441, 2) > f(1) with equality only for 4; = ue. Also from [140] it derives 
that T'(41, U2) does not depend on the choice of \. When assuming f(1) = 0 then 
I’, can be considered as the most general measure of difference between probability 
measures. 

The Csiszar’s f-divergence If incorporated most of special cases of probabil- 
ity measure distances, e.g. the variation distance, y?-divergence, information for 
discrimination or generalized entropy, information gain, mutual information, mean 
square contingency, etc. ['y has tremendous applications to almost all applied sci- 
ences where stochastics enters. For more see [37], [42], [43], [41], [139], [140], [141], 
[152]. 

In this chapter all the base functions f appearing in the function I’y are assumed 
to have all the above properties of f. Notice that y(t, u2) < Pypj(u1, ue). For 
arbitrary functions f, g we notice that 


flog=(fog)’, reER and |fogl=|flog. 
Clearly one can have widely products of (strictly) convex functions to be (strictly) 
convex. 


12.2 Results 


We start with 
Theorem 12.1. Let p,q > 1 such that ‘ + ; =1. Then 


1 1 
TiApiniey < Ce dase) Cigar) (12.3) 


Equality in (12.3) is true when |fi|? = cl fo|% for c > 0. More precisely, inequality 
(12.3) holds as equality iff there are a,b € R not both zero such that 


po ((alfl? ein") (2)) =o, ac. 


Proof. Here we use the Holder’s inequality. We see that 


Piystinisia) = f palfal (2) «Lol (2) aa 
-[ (ern) (irm(2)) 
<(fue)a)" (fie) 


= (Vip.jo(nas 2)!” (Cy paja(uas na)) 


Corollary 12.2. It holds 


1/2 1/2 
Vy fy foi (1, Ha) S (T y2(u41, #2) : (Lp (141, 2) = (12.4) 
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Proof. By (12.3) setting p = q =2. 


We continue with a generalization of Theorem 12.1. 


Theorem 12.3. Let a1, a2,...,4n >1,n€N: }) +=1. Then 


i=l 


YH ‘| (1, H2) < [IC Dy p,j20 (41, #2) gee (12.5) 
i=1 i=1 


Proof. Here we use the generalized Hélder’s inequality, see p. 186, [235] and 


p. 200, [214]. We observe that 
P1 fas |p 
T) a ; i{ — || dA= dy 
pie hee f2) = i P2 IL (2 :) I IT(o! £(%))) 


n 1/a; n 
<I] (mle “(2 aa) = TI] Pipa er oa) 
i=1 


It follows the counterpart of Theorem 12.1. 


Theorem 12.4. Let 0 < p< 1 and q < 0 such that “ aS =1. We suppose that 
pz >0 ae. [A]. Then we have 


1/ 
Dips fol (Has M2) > (Pipyje(ua, ua)) / PD | poje (peas M2) '/4 (12.6) 


unless 1) p,)a(u1, ¥2) = 0. Inequality (12.6) holds as an equality iff there exist A, B € 


R, (A+ B40) such that 
1/p-1 
aE) -2 (Sy) + ob 


Proof. It is based on H6lder’s inequality for 0 < p< 1, see p. 191 of [214]. Indeed 


we have 
P Pp 
TD) fy fo| (Ht, M2) a p2 n(2) h(2)|aa 
x P2 P2 


=f (e¥rinl(2)) (eal) 
: (/,inr(S)@) G2 pal fal! (2 )a) ‘ 


= (Tipit) ” Oetandy ie 


Ap!” 


Next we have 


Proposition 12.5. Let fo(Be ) € Lxo(X). Then 


Pp 
Pippy (oasna) <P p(n) + (eossup fal(24) ). (12.7) 
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Proof. We notice that 


Pippin) = f pela fl(%)ar= fh (oalal(@ 2)) bral “Jay 
«(i 3)-m(0(8) 
= Tp.) (41, 2) - esssup (isa (2)) ; 


In the following we use from [185], p. 120, see also [309], 


Theorem 12.6. Let f and g be positive, measurable functions on X for a measure 
space (X,F,). LetO<t<r<m<oo. Then 
(i) The Rogers inequality 


t 


( a furan) < ( . fat) ( ‘a fa" dn) (12.8) 


provided that the integrals on the right are finite. 
(ii) Form > 1, the “historical Hélder’s inequality” 


([ tom) < (fran) (19 au) 412.9) 


provided that the integrals on the right are finite. 
Using the last results we present the final result of the chapter. 


Theorem 12.7. Let fi, fo with |fil, | fol > 0 and po(x) > 0 ae. [A]. LetO<t< 
r<m<oo. Then it holds 


i) 
(Lys iter (oa 2)” S (Lipa pate (aso) (Pipa pei (eta a)” (12.10) 


ii) Form > 1, 
m m—-1 
(Visa ipei(er be) S Cpa (Has 2) ip ipapn (Ha wa): (12.11) 


Proof. i) By applying (12.8) we obtain 


(Typatpein(east2))” = @ ah (2) ar(2)ax) 
S @ pal fil (2)ia(2)a) 
(fms Yipam(2)ar) 


= (Li pat pole (oa, 42)” (Py put pal (eta #2) 
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ii) By applying (12.9) we derive 


(Lift tyet(se1, #2)” = (f mani( )a(& ») , 
(f vaini(Z)ar) (fl voit (2) isain(2)aa) 


—-1 


< 


= (Cp. H2)) Typ pete (es 2). 


Note 12.8. Let X be a finite or countable discrete set, A be its power set P(X) 
and A has mass 1 for each x € X, then I’ in (12.2) becomes a finite or infinite sum, 
respectively. As a result all the above presented integral inequalities are discretized 
and become summation inequalities. 
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Chapter 13 


Csiszar’s Discrimination and Ostrowski 
Inequalities via Euler-type and Fink 
Identities 


The Csiszar’s discrimination is the most essential and general measure for the com- 
parison between two probability measures. In this chapter we provide probabilistic 
representation formulae for it via a generalized Euler-type identity and Fink iden- 
tity. Then we give a large variety of tight estimates for their remainders in various 
cases. We finish with some generalized Ostrowski type inequalities involving || - ||, 
p > 1, using the Euler-type identity. This treatment follows [46]. 


13.1 Background 


(I) Throughout Part A we use the following. 

Let f be a convex function from (0,-+00) into R which is strictly convex at 1 
with f(1) = 0. Let (X,A,A) be a measure space, where is a finite or a o-finite 
measure on (X,.A). And let j11, 42 be two probability measures on (X, A) such that 
ba <A, 2 < 2 (absolutely continuous), e.g. A = pi + pa. 

Denote by p = gn q= dhe the (densities) Radon—Nikodym derivatives of s11, 
Lg with respect to A. Here we suppose that 


I2eet Ss ae. on X anda<1<b. 
qd 


The quantity 


Pytuasna) = f acai (A) ance (13.1) 


was introduced by I. Csiszar in 1967, see [140], and is called f-divergence of the 
probability measures 4, and p2. By Lemma 1.1 of [140], the integral (13.1) is 
well-defined and T'y(41, 2) > 0 with equality only when 4; = pe. Furthermore 
T'¢(4#41, #2) does not depend on the choice of ». The concept of f-divergence was in- 
troduced first in [139] as a generalization of Kullback’s “information for discrimina- 
tion” or I-divergence (generalized entropy) [244], [243] and of Rényi’s “information 
gain” (I-divergence of order a) [306]. In fact the I-divergence of order 1 equals 


Du logs u( fA, [i2). 
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The choice f(w) = (uw — 1)? produces again a known measure of difference of 
distributions that is called y?-divergence. Of course the total variation distance 
|H1 — Hal = Sx p(x) — a(x) |dA(z) is equal to P),,1)(H1, #2). 

Here by assuming f(1) = 0 we can consider T'y(j11, ju2), the f-divergence as a 
measure of the difference between the probability measures j41, 2. The f-divergence 
is in general asymmetric in 41 and pg. But since f is convex and strictly convex at 
1 so is 


P*(u) = uf (+) (13.2) 
and as in [140] we find 
Dp (Ha, ei) =D yp (11, Ha). (13.3) 


In Information Theory and Statistics many other divergences are used which are 
special cases of the above general Csiszar f-divergence, e.g. Hellinger distance Dy, 
a-divergence D,, Bhattacharyya distance Dg, Harmonic distance Dy, Jeffrey’s 
distance D,, triangular discrimination Da, for all these see, e.g. [102], [152]. The 
problem of finding and estimating the proper distance (or difference or discrim- 
ination) of two probability distributions is one of the major ones in Probability 
Theory. 

The above f-divergence measures in their various forms have been also applied 
to Anthropology, Genetics, Finance, Economics, Political Science, Biology, Approx- 
imation of Probability distributions, Signal Processing and Pattern Recognition. A 
great inspiration for this chapter has been the very important monograph on the 
topic by S. Dragomir [152]. 

In Part A we find representations and estimates for I’ (11, 42) via the generalized 
Euler type and Fink identities. 

We use here the sequence {By(t), k > 0} of Bernoulli polynomials which is 
uniquely determined by the following identities: 


Bi(t) =kBri(t), k>1, Bo(t)=1 
and 
B,(¢+1)— B.() = kt? *, k> 0. 


The values By = B,(0), k > 0 are the known Bernoulli numbers. We need to 
mention 


1 1 
Bot)=1, Bil)=t—5, Ba(t) =t —t+e, 
B3(t) =8 - 3241) Bat) = t* -2e +0? - ae 
i ae a 30’ 
5 5 1 5 ee A 
Beir SP fo = Sh. and Bet) =f =—3F + 6 — 4+. 
5(t) 5 3 gr ae 6(t) 3 75 a1 D 


Let {Bg(t),k > 0} be a sequence of periodic functions of period 1, related to 
Bernoulli polynomials as 


Bi(t)=B,(t), O<t<1, Be(t+1)=Bix(t), teR. 
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We have that Bj (t) = 1, By is a discontinuous function with a jump of —1 at each 
integer, while BZ, k > 2 are continuous functions. Notice that B,(0) = B,(1) = Bx, 
k>2. 

We use the following result, see [53], [143]. 


Theorem 13.1. Let any a,b € R, f: [a,b] > R be such that f"-), n> 1, is a 
continuous function and f(x) exists and is finite for all but a countable set of x 
in (a,b) and that f(™ € Ly([a,b]). Then for every x € [a,b] we have 


. 2 (ob — a) LG 
Ne pag f, Home OOS ae FSW - FCW 


4 ae =. (G2) = Bs (F=2) | sree (13.4) 


The sum in (13.4) when n = 1 is zero. If f("~" is absolutely continuous then (13.4) 
is valid again. 


We need also 


Theorem 13.2 (see [195]). Let any a,b € R, f: [a,b] > R, n > 1, f"-) is 
absolutely continuous on [a,b]. Then 


b n-1 
N= sry ff fod- SAG) 


z : n-1 (n) 
cs aaeca | (c—t)” k(t, x) f\™ (t)dt, (13.5) 
where 
n— (K-1)(q\(~ — q)k — fR-D ee 
and 
t—a, a<t<a<b, 
Me) a a<a<t<b. (13.7) 


(II) In Part B we give L, form, 1 < p < ow, Ostrowski type inequalities via 
the generalized Euler type identity (13.4). We mention as inspiration writing this 
chapter the famous Ostrowski inequality [277]: 


tcf oe boss 


where f € C’([a,b]), x € [a,b], which is a sharp inequality, see [21]. 
Other motivations come from [53], [82], [21], [143]. Here our inequalities involve 
higher order derivatives of f. 


(b—a)l| f'lloo, 
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13.2 Main Results 


Part A 
We give 


Theorem 13.3. Let f: [a,b] —~ R as in Section 13.1 (I) and be such that f"-, 
n> 1, is a continuous function and f(x) exists and is finite for all but a countable 
set of x in (a,b) and that f™ € Ly, ({a,b]), for 1 <p; < 00. Then 


b 
Py(msia) = fe f(a (13.8) 
n-1 P 
(b= a) ( e(&-1) (k- ce 
ee ee Os a) { aBa( —* )ar+ Ro, 
where 
Ae: Pose RP _ 
x fol (a) -n( rs) 
One has 


bas n—1 
n! xX [a,b] 


Proof. In (13.4), set «= z multiply it by q and integrate against 2. 


ria ee” Vee 
q * q n 


(13.10) 


Next we present related estimates to I's. 


Theorem 13.4. All assumptions as in Theorem 18.3. Then 


(i) 


ir (fal f 


aan) (Fo lass 819) 


( yn 1 p_ q1 1/q 
a, h 
ee (/ af By (t) a. (; i) aa) lov 
ni xX 0 b-—a 
1 1 
forpi,a >1:—+—=1, (13.12) 
Pl 71 


a) fla, (13.13) 
co, [0,1] 
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if f™ € Ly (a, 4). 
Proof. We are using (13.10). Part (iii) is obvious by [143], p. 347. For Part (i) we 


2G | ee 
q _ pxl 4 (n) 
(7) (7) at) an) E 


and by the change of variable method the claim is proved. 
For part (ii) we apply Hoélder’s inequality on (13.10) and we have 


ir fal flan 


Ping \ 3/2 a 
- wij a) ar) |F| 
n b—a Pl? 


again by change of variable claim is established. 


notice that 


(13.10) (h — q)r—1 
[Rn] < ae (/ (| 
x [a,b] 


ni 


Next we derive 
Theorem 13.5. Let f as in Section 13.1, (I) such that f°—) is absolutely con- 


tinuous function on [a,b], n > 1 and that f(™ € Ly, ({a,b]), where 1 < pi < oo. 


Then 
n a ii gene 
Prise) = Gf sae — S (5) (13.14) 
, ji Sx *(p - = FOV) fxg *w- ne) Go, 
where 
_ 1 2-—n : = n-1 Pp (n) 
Gp i= Dia) [a (/ (p — tq)” *k (« 2) f oat) dX, (13.15) 


and 


1 2-—n ; n-1 
el<o—e—a fe (/ |p — tq| 


Proof. In (13.5) set x = f multiply it by q and integrate against A. 


k (:.2) cou dX. (13.16) 


We give the estimates 
Theorem 13.6. All suppositions as in Theorem 13.5. Then 


() 
1 —n n+1 n+1 n 
ol a (fo ea + (on = yar) FM (08.17) 


if f™ € Loo((a, b]), 
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(i) 


IG [se (B(qa +1, a(n —1) +1))/@ 


(n — 1)!(b— a) 


—n-+ n in V/q n 
x (a 7 {(p—ag)®"+1 + (bg — pyre yr/4 aa) ‘pss 


(13.18) 


for p1,% >1: on Se x = 1, if f™ € Lp, ({a, ]), 
and (iii) 


al < peas (fa (m(E-e 0-2) a), 529 


if f™ © Li((a, 0). 
Proof. (i) We have by (13.16) that 


Gal < Sees (/. ea Ip — tq|" 


if f™ € Loo([a, 6). 
Here 


i (12)|a)aa) LF loo 


Then by [338], p. 256 we obtain 


b 
yp ain-1 Dp ees: = n+1 _ »yntl 
[iw ear| (02) far = SP (aay + (on 9), 


which proves the claim. 
(ii) Again by (13.16) we derive 


1 —n ; n- 
IGn| < Ge =a) @ ¢ (/ |p — tq|% 1) 
na 1/qi 
p (n) 


for pi,g1 > 1: +2 =1, if f™ € Lp,([a,0]). Again by [338], p. 256 we find 


b p q1 V/n 
[e-em |e (2) dt 
a qd 


= g(a) (Ba + 1,au(n— 1) +1) ™ {(p— ag) e"4 + (bq — pynrttpy™, 


proving the claim. 
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(iii) From (13.16) we obtain 


n-1 


1 2-—n 
IGn| < elo). [a \|p — tq| oo [asbl 


K(«.2) 
qd 


K(«.2) -|[¢-4| =max (2 —a,0-2), 
qd sate qd oo, [a,b] qd qd 


The last establishes the claim. 


ad} fl, 
oo, [a,b] 


Next we study some important cases. For that we need 
Theorem 13.7. Let any a,b€ R,a< a <b, call A= =: It holds 
(i) 


h(\):= [ |B (t) — By(\)|dt = it < * Vx € [a,b], (13.20) 


with equality at x = a,b, 
(ii) ({143], p. 343) 


1 
3 a 1 1 
(3) = |Bo(t) — Ba(H)|dt = 55%(a) — (2) + <z, Vr € [a,b], (13.21) 
0 
with equality at x = a,b, where 
Jc -— 
6(z) := ——+,  ré[a,}], (13.22) 
b-a 
(iii) ([13]) 
1 
I3(A) := | |Bs(t) — Bs(A)|dt = 
0 
Sige ee at Bee. i eh ae & 3- V3 
pre Se aN Sy. “Ae |G | 
a ee d ~ (3-Vv3 1 
P14 53, 4 P54 3 pees i 
ty — 2ty 5 5 3A° — 2X aed ie ( r 3) 
7 ae (13.23) 
t 3+, * ~ nN ~ 1 34+Vv3 
244 3, % % 4,53, 52 _ 4 + 
ty — 2t5 5 5 A” +A 5? rAE (5. r [ 
eee ee ee 2 pee De “ey TEAS 
a fief = 2 ae eS 1 
pees a Tg 3A" + 2X 5? AE rama 
where 
Bx eo aan, ae 
ge RA ees =~ 4.3) _ 332 
t= (agra a 3A — 32, 
ee ee eee 
pal Es Sul Sep ey. 948 
t= 7-5 t5yqt3\-3, (13.24) 
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and 
~ 3 
noe, (13.25) 
with equality at A= 3eV3 
(iv) ([53]) 
1 
14) := f |Ba(t) — Ba(A)lat 
<5 ; ; ; 
“ 7 18 Pte, 0<\<3, 
= (13.26) 
u x4 2 ~ 1 2 
ee gi? — ws +3V2-—, 4£<X<1, 
10 
with 
~ 1 
<— : 
I4(A) < 30° (13.27) 
attained at \ = 0,1, i.e. when x =a,b, and 
(v) ([143]) for n > 1 we have 
“ 1 7 1 7 1/2 
(3) = f Balt) — Budlat < ( f (Balt) — By A))Pat) 
0 0 
_ , | <n? 2(3 
= Gry |Bon| + B2(A). (13.28) 


Consequently by (13.11) and Theorem 13.7 we derive the precise estimates. 


Theorem 13.8. All assumptions as in Theorem 13.3 with f™ € Lo(la,b]),n > 1. 
Here 


Ma 2a 
een _ ae _ 4 
A:= A(x) = iG pag! rE X, (13.29) 
ie. X€ [0,1], ae. on X. Then 
b—a)” ie % 
[Ral <P (fatal dr) [a (13.30) 
Nn: x 
and 
b-—a 
al < Sp. (13.31) 
b— a)? 
ira] < PI pe, (13.32) 
/3(b — a)? 
[Rs] < POH pry, (13.33) 
and 
b—a)* 
[Ral < COST 1 FO... (13.34) 
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We have also by (13.12) and (13.28) that 
Corollary 13.9. All assumptions as in Theorem 13.3 with f\™ € Lo([a,b]). Then 


(b— a)” (n!)? 2 rake (n) 
nl <P a 4] Go lBonl + B2(2—) J aa} FI, 2 . 


(13.35) 
We need 


Note 13.10. From [143], p. 347 for r € N, 2 € [a,b] we have that 


wa 


b-a 


) | = (L987) Bl 2? Ba, ~ Bae (F=*) (13.36) 


max 
te [0,1] 


And from [143], p. 348 we obtain 


u—a 
Bor4i(t) — Bor+i ( 5 ) | 
—a 


Boy (t) — Bop ( 


max 
te [0,1] 
2(2r + 1)! r-—a 
Ss ST + [Brn (F=*)| (13.37) 
along with 
L-a 1 a+b 
Bi (t)-B =x - 13. 
ag [Put (SS) |= a+ fe AS ae 


Using (13.13) and Note 13.10 we find 


Theorem 13.11. All assumptions as in Theorem 13.3 with f™ € Ly({a, 0), r,n > 
1. Then 


Ril < {5 + [ lp- eet lan} Iv'lh. (13.39) 


when n = 2r we have 
(b es ayer 
(2r)! 


+ fa 
x 


and for n = 2r+1 we obtain 


|Ro,| < 


{1 — 2°7°)| Bo, | 


P_q 
q 
—a 


27?" Bop — Bay(4 


) 


aah, (13.40) 


|Ror4il < 


(b— a)?" 2(2r +1)! 


fa 
+ [ a)Bmn(4 ) 
x —-a 


aah sore, (13.41) 
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Part B 


Remark 13.12. Let f be as in Theorem 13.1, we denote 
b 


An(2) = f(x) - —s f(t)dt cae 
n-1 mao k-1 rae 
_ S- On 5, (2) = f-Va)], ea tinst. 
k=1 : 


We have by (13.4) that 


Se 


Let here 1 < p < oo, we see by Holder’s inequality that 


[An(a)| << COO (/ B (F=*) ~ a: (F=) ) Ol. 


(13.44) 
where q > 1 such that aa = 1, given that f™ € L,([a,b]). By change of variable 
we find that 


(*) = Cs a ( | ‘|B, - B, (7 ) Pat) if fll. (48.48) 


We have established the generalized Ostrowski type inequality. 
Theorem 13.13. Let any a,b € R, f: [a,b] > R be such that f"-), n > 1, is 
a continuous function and f(x) exists and is finite for all but a countable set of 
x in (a,b) and that f(™ € L,([a,]), 1 < p,q < oo: ao = 1. Then for every 
x € [a,b] we have 
1/q 


nto} s C=O ( Pl, ~ a ($2) [at Fle. (13.46) 


The last theorem appeared first in [143] as Theorem 9 there, but with missing 
two of the assumptions only by assuming f‘”) € L,((a, ))). 

We give 
Corollary 13.14. Let any a,b € R, f: [a,b] — R be such that f"-), n> 1, is a 
continuous function and f(x) exists and is finite for all but a countable set of x 
in (a,b) and that f(™ € La({a,b]). Then for any x € [a,b] we have 


ani < a — Bon] + B3(F— 2) ia (13.47) 


n! (2 


Proof. By Theorem 13.13, p = 2 and that 


[ (2. -B,(2=2)) ae HO |Ban| + BE (=). (13.48) 


see [143], p. 352. 
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Here again, (13.47) appeared first in [143], p. 351, as Corollary 5 there, erro- 
neously under the only assumption f‘”) € L2({a, b]). 
Remark 13.15. We find the trapezoidal formula 


Aaa) = A4(s) = (FOE AO) C8 ype f soa, (13.49) 


and the midpoint formula 


a.(**) = (=) Es veo) (f(b) — f'(a)) — — [ sou (13.50) 


Furthermore we obtain the trapezoidal formula 


As(a) = As(b) = Ao(a) = Ag(o) = OFM) C—O ney _ pray) 


aye b 
ona) (f(b) — f’"(a)) =a f(t)dt. (13.51) 


We also derive the midpoint formula 


as(S$*) = a6(*2*) = #(S*) + SP rw - ro) 
— a pm) r"(@) 


Using Mathematica 4 and (13.47) we establish 


(13.52) 


Theorem 13.16. Let any a,b € R, f: [a,b] = R be such that f®) is a continuous 
function and f(a) exists and is finite for all but a countable set of x in (a,b) and 
that f € Lo({a,b]). Then 


Safa) < Ph, (13.53) 
ae( S| < yee 1 (13.54) 


We continue with 


Theorem 13.17. Let any a,b€R, f: [a,b] — R be such that f“ is a continuous 
function and f(x) exists and is finite for all but a countable set of x in (a,b) and 
that f® € Lo([a,b]). Then 


IAs(a) < C=O | p09, (13.55) 
aN TAA ISTO = 
as(*)]s — ae Fle (13.56) 


We finish with 
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Theorem 13.18. Let any a,b €R, f: [a,b] = R be such that f©) is a continuous 
function and f(x) exists and is finite for all but a countable set of x in (a,b) and 
that f© € Lo({a,b]). Then 


1 101 


6 eh Sg)? (FO? 13.57 
Saa0 30030 a)? || fF’ Ile, ( ) 


|Ac(a) 
and 


a+b 1 7081 5.54) ¢(6) 
——— i eR a: 
a0( 2 )|< sam aig 0 NF Ile (13.58) 


Chapter 14 


Taylor-Widder Representations and 
Gruss, Means, Ostrowski and Csiszar’s 
Inequalities 


In this chapter based on the very general Taylor—-Widder formula several represen- 
tation formulae are developed. Applying these are established very general inequal- 
ities of types: Grtiss, comparison of means,Ostrowski, Csiszar f-divergence. The 
approximations involve L, norms, any 1 < p < oo. This treatment follows [56]. 


14.1 Introduction 


The article that motivates to most this chapter is of D. Widder (1928), see [339], 
where he generalizes the Taylor formula and series, see also Section 14.2. Based on 
that approach we give several representation formulae for the integral of a function f 
over [a,b] C R, and the most important of these involve also f(x), for any x € [a,b], 
see Theorems 14.6, 14.8, 14.9, 14.10 and Corollary 14.12. Then being motivated 
by the works of Ostrowski [277], Griiss [205], of the author [48] and Csiszar [140] 
we give our related generalized results. We would like to mention the following 
inspiring results. 


Theorem 14.1 (Ostrowski, 1938, [277]). Let f: [a,b] — R be continuous on [a, b] 
and differentiable on (a,b) whose derivative f’: (a,b) — R is bounded on (a,b), t.e 
lf’llo = sup |f’()| < +00. Then 

tE (a,b) 


b 
— | fat He) < 


1, @-8)"] 
~o | (b—a)llf'|loo, 


for any x € [a,b]. The constant + is the best possible. 


Theorem 14.2 (Griiss, 1935, [205]). Let f,g integrable functions from [a,b] into 
R, such that m < f(a) < M, p< g(x) <o, for all x € [a,b], where m, M, p,o € R. 
Then 


= —f Fa) g(a)de — (= [ reve) (= [ sc2yar) 


173 


174 PROBABILISTIC INEQUALITIES 


So our generalizations here are over an extended complete Tschebyshev system 
{ui} € C”*1([a, b]), nm > 0, see Theorems 14.3, 14.4, and [223]. Thus our Os- 
trowski type general inequality is presented in Theorem 14.14 and is sharp. 

Next we generalize Griiss inequality in Theorems 14.15, 14.16, 14.17 and Corol- 
lary 14.18. It follows the generalized comparison of integral averages in Theorem 
14.19, where we compare the Riemann integral to arbitrary general integrals involv- 
ing finite measures. We finish with generalized representation and estimates for the 
Csiszar f-divergence, see Theorems 14.21, 14.22. The last is the most general and 
best measure for the difference between probability measures. 


14.2 Background 


The following are taken from [339]. Let f, uo, ui,.--,Un € C"*({a, b]), n > 0, and 
the Wronskians 


uo(x) ur(a Uji (x 
up(z) uy (2) uj (2) 
Wi(c) = Wluo(2),w(2),-..,us(z)} =|”, ; 
up? (a) ula) uf? (a) 
= Oped marsy Ms 
Suppose W;(a) > 0 over [a,b]. Clearly then 


Wi (x W;(x)Wj_2(x 
$o(x) := Wo(x) = uo(z), o1(x) = TIE ile) = See 
i = 2,3,...,n are positive on [a, b]. 
For i > 0, the linear differentiable operator of order i: 
Lafl0) = Ts Fete 
Lof(x) := f(x), Vax € [a,b]. 
Then for i=1,...,2+1 we have 
a a ne ae eae f(a) 


d 


Consider also 


uo(t) ur(t) Uji t) 
uo(t) u(t) uj(t) 
gle) | : : 
Oh gq uy W) 
Uo(x) ur(x) U(x 


i= 1,2,...,n; go(z,t) := cata Va,t € [a,b]. 
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Note that g;(x,t) as a function of x is a linear combination of uo(z), 
ui(a),...,uz(a) and it holds 


G(X, t) 


= a a [ $1(1) ie a bi-1(Xi-1) i bi (x;)dx;daj_1--- dry 


1 # i 
=sp~aw | go(s) ++ bi(s)gi-1(2, 8)ds, (ec—al ee er 7 


Example [339]. The sets 
{1,2,27,...,2"}, {1,sin 2, —cos2, — sin 2x, cos2z,...,(—1)""'sin na, (—1)" cosnx} 


fulfill the above theory. 
We mention 
Theorem 14.3 (Karlin and Studden (1966), see p. 376, [223]). Let uo, u1,..-,Un € 


C”([a,b]), nm > 0. Then {u;}?_9 is an extended complete Tschebyshev system on 
(a, b] aff Wi(x) > 0 on [a,b], 7=0,1,...,n. 


We also mention 
Theorem 14.4 (D. Widder, p. 138, [339]). Let the functions f(x), uo(x), 


ur(x),...,Un(x) € C"*1([a, b]), and the Wronskians Wo(x), Wi(x),...,Wn(x) > 0 
on [a,b], x € [a,b]. Then for t € [a,b] we have 


Fla) = A + La f(tgu(ast) +++ + Ln flOgn(t,0) + Ral), 


where 
Ry(2) =| Gn(x, 8) Ln4if(s)ds. 


E.g. one could take uo(z) =c > 0. If u(x) = z*, i= 0,1,...,n, defined on [a,b], 
then 


(x — t)' 
il 


Lif(t) =f and gi(x,t) = , té [a,b]. 


So under the assumptions of Theorem 14.4 it holds 


x 


4S Lifton) + fO gnle.QLnnr flat, vey € [a8 
= (14.1) 


uo(a 
uoly 


f(x) = fy) 
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14.3. Main Results 


Define the Kernel 


t-a, a<t< b; 
K(t, 2) := oc oe (14.2) 
Integrating (14.1) with respect to y we find 


b 
F(2\(b—a) = ua(e) f AY 


b 
Lif (y)gi(a, y)dy 


=1 a 


+f (fo au(esthner sO) dy, Vxel[a,b]. (14.3) 


We notice that (see also [195]) 
[aufa- [afar f ay fa 
y 
y x t b b 
-faf y- fw a= f at f ay— fat | dy. (14.4) 
a a x x t 


By letting * := g,(x,t)LZn+if(t) we better notice that 


[Uf cahan= [Of sarhare [ (fae) 
-[(L-a)-£ a) 
a oe (La)e 
= * (t —a)dt+ *« (t— = * k(t, x) dt. : 
| (t — a)dt : (t — b)dt | k(t,x)dt. (14.5) 


Above we have that 


and 
rsysb x<t<b 
ey) a 
So we obtain 


Lemma 14.5. It holds 
b x b 
i) (/ an(zst) Env f(a dy = / n(x, t) Enzi f (E)K (Et, x) dt. (14.7) 
a y a 


We give the general representation result, see also [195], as a special case. 
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Theorem 14.6. Let f(x), uo(x), ui(z),...,Un(x) € C"*1([a, b]), n > 0, the Wron- 
skians W;(x) > 0,i1=0,1,...,n, over [a, ‘i x € [a,b]. Then 


b 
f(a) = 2) : IO y+ Alef Lif ty (sah) 


vee! 
b—a 
When uo(x) = c > 0, then 


b n b 
— | t+ (>: / bsdatesd 


fe 1 
b-a 


/ gn(2,t)Lnsif(t)K(t,x)dt, Ve € [a, 8). (14.8) 


; Gn(@, t) Engi f(t) K(t,x)dt, Va € [a,b]. (14.9) 


Proof. Based on (14.3) and Lemma 14.5. 
We also need 


Lemma 14.7. Let f, {ui}%p € C”((a, b]), and W(x) > 0, i =0,1,...,n, n > 0, 
over [a,b]. Then 


n-1 


ed Lif(y)gi(z, y)dy = So(n- ome — Lif (a)gi4i (2, @)| 


i=1°% i=0 
LfwWgin(ty)\ wer ° £Y) ig: 
+ f (Pian ae +nuo(e) {Ae ay, ve {a,d). (14.10) 
In case of uo(x) = c > 0 then we have 
n b n-1 
ay, Lif (y)gi(x, y)dy = So (n- Hf [Lif (b)gini(a, b) — Li f(a)gipi(x,a)] 
41, i=0 
” (Lif y)girr(@,y) \ 4) ‘ 
+f (Aen) seston} +n fi fy)dy, Yee fat}. (14.11) 


Proof. By [339] for i =0,1,...,n—1, we get 


(Tove )) avalon) -[ (Tee )) aera 
(To ()nostean) =- (Tle ate) 


i+1 i+1 
—d (II x(y) ) aunt X,Y )) - (Il ony ) gilx, y)d (14.12) 
k=0 


Thus 


and 
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Consequently for 7 = 0,1,...,2— 1 we have 


b b t+1 
Ko) = f bifdolen)ey = | Ae 2 (Toni )) enna 
: y)) 


b i+1 
(14.12) -| ae (IL only )) ava x ) 


II oxy) 
k=0 


b b fitl ; 
= —Lifly)giri(@,y) +f (I ony ) a x,y)d pee 
i i dx (y) 
(14.13) 
So we find for i= 0,1,...,n—1 that 
Ji(x) = Lif(a)gisi(x, a) — Li f(b) gi+1 (x, b) + 4;(2), (14.14) 
where 
b itl : 
cee (I dey ) x,y)d oA) (14.15) 
7 IT ¢x(y) 
k=0 
Again from [339], for i = 0,1,...,n—1, we have 
Lif) @ 1 d@ 1 d@ di 4 fy) 
: fee eae (14.16) 
1 dy i dy a dy d d 
Tl daty) y $i1(y) dy 6i-2(y) dy — dy ox(y) dy bo(y) 
and 
Lif) @ 1 @ ito dodaiid fy (14.17) 
a dy d;(y) dy dy— d d d . ; 
TI xt y dily) dy Gi-1(y) dy dy o1(y) dy do(y) 
That is, for i =0,1,...,2—1 we derive 
Lisifly) — @ | Lifly) biti) 
i dy i+1 
IT ¢x(y) IT ¢«(y) 
k=0 k=0 
d LD; L; / 
=| 5 (GM) fons | SO Yow. cars) 
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Hence we find 


Bo a ew) 
d ( Lif(y) eee re il 
dy (= ) 7 bi+1(Y) as 
II ¢«(y) 
k=0 
for? =0,1,...,n—1. That is 
d ou - pane) _ pate oan di GEO Teale 
Il ¢x(y) IT ¢x(y) IT ¢«(y) 
k=0 k=0 =0 
(14.20) 
Applying (14.20) into (14.15) we get 
b / 
ace) = J gisatosu) [Lees Fly) — Las SE cy 
b b 
=f slentintindy— [ tFe)aeale yy 22. 
That is, we proved that 
b ; 
6;(x) = Jizi(a) — / Liftwgiea (ey) SA) , fori=0,1,...,n—1. (14.21) 
Therefore it holds 
b ; 
K(0) = Eafla)gisa(e.a)—Lif gees e.0)+Sia(0)— fLif(y)giea(ey) Sw , 
a bi+1(y) 
and 
Jisa(a) = Si(x) + [Lif (0) 9:41 (2, b) — Lif (@gini (2, 0)] 
b , . 
+ flO sl) 4,269, (14.22) 
for? =0,1,...,n—1, with 
b 
Jo(x) = uo(2) f a dy. (14.23) 
Thus we obtain 
n-1 n-1 
So (n= t)(Jiss(2) — Ji(x)) = SO(n - 4) [Lif (b) gia (@, ’) — Lif (a)gita(2, 0)] 
i=0 i=0 
= f° (Lif ygini(ey) 
y(n re | Bone cee ) dpi41(y). 


(14.24) 
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Clearly, finally we have 


So (n= i)(Jesa(@) — K(@)) = Yo Ia) — ndo(a), (14.25) 
1=0 w=1 


proving the claim. 


We give 


Theorem 14.8. Let f,{ui}?) € C”([a, 6]), and W;(x) > 0, i=0,1,...,n,n > 0, 
over [a,b]. Then 


b n b 
w(x) [LH ay = 2 > / Lslodate.ny) 


a UO (y) 
n-1 


+ » (1 = *) [Lif (a) gi¢1(x, a) — Li f(b) gi41(2, 6)| 


S12) fi (Rome) oy sand, vee fot 


1=0 
(14.26) 
When uo(x) = c > 0, then we obtain the representation 
b nr pb 
| fay =- bs / bso 
a i=1 a 
n—-1 . 
+ DO (1- 5) fest@aina(esa) ~ bFO)giva 8) 
i=0 
= i\ [? (Lify)gisi(a,y)\ 
EO a) LEGG) Menton Yeetot 
(14.27) 


Proof. By Lemma 14.7. 


We give also the alternative result 
Theorem 14.9. Let f,{ui}tp € C”([a,b]), n > 0, x € [a,b], all Wi(x) > 0 on 
[a,b], 2 =0,1,...,n. Then 


b b 
uo(z) : fay = [ Lnf(y)gn(«, y)dy 


ts x [Lif (a) gis (@, a) — Lif (b)9i41(2, b)] 
1=0 
_ i Lif (y)gi+1(@, ¥) 


bea) sista} Va € [a,b]. (14.28) 
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When uo(x) = c > 0, then 
b b 
[frau = fi tn fludgn(e. way 


n-1 


+> [Lif (a)gi+1(@, 0) — Li f(b) gi+i(, 6)] 


i=0 


> Lif (ygiui(2,y) 
“ : ELM ses 2D 4, onan} Va € [a,b]. (14.29) 


Proof. Set 


Aj(x) := 


Lif (b)gi41(2, b) — Lif (a)git1(2, a)] 


Lif (y)girs(2,y) pes 
a) Grail) Pett ody) i=0,1,...,n—1. (14.30) 


From (14.22) we have 


Jinn = J t+ Ai, 7=0,1,...,n—-1. 


(14.31) 
That is 
J, = Jo + Ao, 
Jo = Jy + Ay =Jjp9+Ao+ Ai 
J3 = Jo + Ao = Jo + Ao + Ar + Ag 
n-1 
In =dn-1 + An-1 = Jo + S- Ai, 
i=0 
i.e. 
n-1 
Jn(x) = Jo(x) + > Ai(x), Va € [a,b]. (14.32) 
i=0 
Hence we established 
b b 
/ Lif (y)gn(x, y)dy = uo(2) fy) dy 
a a uo(y) 
n-1 
e ay [Lif (O)gi1(2, b) — Lif(@)gi+i(2, a)| 
i=0 


i Lif (y)giai(@,Y) 
+f Lest, onan} Vane (a, OI, (14.33) 


proving (14.28). 


At last we give the following general representation formulae. 
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Theorem 14.10. Let f, {u;}%_) € C"'!([a, b]), n > 0, x € [a,b], all Wi (x) > 0 on 
[a,b], i=0,1,...,n. Then 


fn to). f° LO ay Sin — pd Lif Oger (ed) ~ Lif (agin (, 9) 
fe) = +0555 | Zqytt Lt 1 —. 
LL fp? (fu) G10, 9) HY) 
=a) ( biti (y) ) a} 
+o eB icet ORG Oa. VLG (14.34) 


If uo(x) = c > 0, then 


= i i xv — i Q)Git+1\v,a 
je) = Ot) i fy) iy Son of Heist ,b) — Lif (a)gis1(@,4)) 
? i=0 


b-a 
: = [ (Efwauale net's) iy} 


1 


" b—a 


b 
Gn(2, t) Engi f(t) K(t,x)dt, Va € [a,b]. (14.35) 


Proof. By Theorem 14.6 and Lemma 14.7. 


Note 14.11. Clearly (14.8),(14.9),(14.34) and (14.35) generalize the Fink identity 
(see [195]). 


We have 


Corollary 14.12. Let f,{ui}™_ € C"*({a, b]), n > 0, x € [a,b], all Wi(x) > 0 on 
[a,b], =0,1,...,n. Then 


as f(z) uo(x) f? fy) 
En+1(2) = (n+ 1) — — ada ay 
Lf (Lif (@giar(x, 0) — Lif (0) 9:41 (x, 0)] 
Ca) Stn of Maeda faint 
- =a, ( bi+1(y) iy} 


1 b 
= eee i; Gn(2,t)Lngif(t)K(t,x)dt, We € [a,b]. (14.36) 
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If uo(x) = c > 0, then 


b 
1 Af Lif (@ginr(, a) — Lif )gi41 (2, 0)] 
aa Ea b-—a 
1 ff? (Lif (y)gi41 (2, 9) O41 (y) 
- eri ( di4i(y) ) a} 
1 


b 
= GEneso ‘| dn(2,t)Lmaif(t)K(t,x)dt, Ve € (a,b). (14.37) 


Proof. By Theorem 14.10. 


Note 14.13. If ¢/,,(y) =0, i =0,1,...,n—1, then E,41(x), En4i(a) simplify a 
lot, e.g. case of u;(x) = x’, i=0,1,...,n, where diyi(y) =i+1. 


Next we estimate En4i(2), En+1(2) via the following Ostrowski type inequality: 


Theorem 14.14. Let f, {ui}t%p € C”*!([a, bj), n > 0, x € [a,b], all W;(x) > 0 on 
[a,b], 2 =0,1,...,n. Then 


|En+i(2)|, 1 c 
eel ; maa | i. |gn(a, t)|(t — a)dt 


b 
Flr i gn (x, t)|(b — bat I[Entiflloo llgn(#s)Iloo 


max((x — a),(6— 2))|[Ln4if la, ll9n(2, D(a 


(14.38) 
where p,q > 1: gE ; = 1, any x € [a,b]. The last inequality (14.88) is sharp, 
namely it is attained by an f € C°* [a,b], such that 

[Entif(t)| = Algn(a, t)K(t,2)|%”, 


when [Lnsif (t)gn(2,t)K(t,x)] is of fixed sign, where A > 0, Vt € [a,b], for a fixed 
x € |[a, b). 


Proof. We have at first that 
z 1 b 
JEn+i(x)| < ainece | l9n(x,t)| |Ln4if(t)| |K(E, x) |dt 


|Zn+1f loo 
<+)b-a) 
_ |p eaeiieaw sae +P onl |gn(a, t)|(b— t)dt]. 


(n4 
(14.39) 


J tonte.oixceapla 


3 
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Next we see that 
|Lnti fll 


Enti(2)| < earn 


Finally we observe for p,q > 1 with “ + ; = 1, that 


En+ifllollgn(z, )kC, 2) lq 
(n+ 1)(b-—a) ; 


) lan(e,Ilo mar — a), (6 —)). (14.40) 
lEnsi()| < (14.41) 


The last is equality when 
Emar f(t)!” = Blgn(x,t)k(t, 2) 4, (14.42) 


where B > 0. The claim now is obvious. 


Next we present a basic general Grtiss type inequality. 


Theorem 14.15. Let f, h, {ui}%@p) € C"T!({a, b]), n > 0, all Wi(x) > 0 on [a,b], 
i=0,1,...,n. Then 


—  [ senta)ee es (= [ eae) (Ff h(x) 


~ ooo? o» [ [ow@use aM f(2)Lsh(y)) (es way) 


< spe (ff en [eile 


+ |f(2)| | Ln-+1hlloo) Ign (e, )| (t,2) aed) =: My. (14.43) 


D:= 


Proof. We have by (14.9) that 


F(z) = af sos ald [ Lifluatesu)du) 


1 b 


and 


me) = A [mney + (0 [ ercatenae) 


1 b 
ar Bea a 


Therefore it holds 


payne) = Ff payars (Do [mertasnacevdty) 


Gn(X,t)Lnyih(t)K(t,x)dt, Vax € [a,b]. (14.45) 


b 


+ ee fra) Ensrflan(e.t)K(t2)at, (14.46) 
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and 


b nnd 
Flaynta) = 5 [Medd + (Lf seoeamnatevdtn) 
b 
+ —/ f(x) Ln4ih(t)gn(ax, t)K(t,x)dt, Va € [a,b]. (14.47) 
Integrating (14.46) and (14.47) we have 


if _ UF fa)dex) (f’ h(w)dx) 
ba) : He Made (ba)? 


i een (xf [me Laflua(es duet) 


ay on f [ve \Ensif(Hogn(a, t)K (t, x)dtda, (14.48) 


roy f ten (a)de = Wa Fada) (Jy ha)da’) 


(b—a)? 


cor (Sf [ F(2)Lih(w)ai(as dude) 


b b 
(b — | i; f(x)Ln4ih(t)gn(x,t)K(t,x)dtde. (14.49) 


Consequently from (14.48) and (14.49) we derive 


wea fonts aman toe) L Mo) 
wor pom ff Laflu)an(e.u)due) 


z ae ao f fe Ese F Oe DRE Dade, (14.50) 


wal, Komen map Home) f Mom 
oat ees ff see h(a)an(e.y)due) 


7 oof a f(@)Ln4ih(t)gn(@, 1) K(t, x)dtde. (14.51) 


and 
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By adding and dividing by 2 the equalities (14.50) and (14.51) we obtain 


A= rs [ seoneres 2 - — ([ meyae) 
- eee [me (x) Lih(y Nase s dvd 


_ vere :( ii [0c emery ) + Ha)Ensih(d)onle, NK (lehdede). 
(14.52) 


Therefore we conclude that 


Al< Laff 1h()| [Ena leo 


+ |f(2)| Ln+ihlloo) Ign(, )| IK(L.a)ldtdr), (14.53) 


proving the claim. 
The related Gruss type L; result follows. 


Theorem 14.16. Let f, h, {ui}tp) € C"T!([a, b]), n > 0, all Wi(x) > 0 on [a,b], 
i=0,1,...,n. Then 


3 playnteyde ~ (>> [ nterae) (GAS fnteyar) 


reed Onn [we (x) Lih(y )ace,u)dve) 


i=1 “ @ 


du t)|o0,{a,0}2 (MPlloollLn4if lla + [flloolLn4ihll1) =: Me. (14.54) 
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Proof. We obtain 


|A| = )Lnsif (on(a, t)K (t, 2)dtdx 


2 vy io f(0)Ln4ih(t)gn (x,t) K (t, 2)dtda 


<g> {ff a |A(x)| Ln+if(t)| |gn(a, t)|dtde 
+ i i 2 tna h( anole] 
b b 
< aa ( ¢ s(t) Ill] col|gn (x, t)||00,f0,0]2 
b b 
+(f (/ aM) A) illints ona | 


= ${ lene flan ote 


t Wena lelin( hoa 


5 sllgn( sf) ta. a (WlloolZntif lla + Il fllool]Zn+1hll1), 


establishing the claim. 
The related Griiss type Ly result follows. 


Theorem 14.17. Let f, h, {ui}%p. € C™t([a, b]), n > 0, all Wi(x) > 0 on [a, bj, 
i=0,1,...,n. Let also p,q,r > 0 such that = + steel. Then 


oad Oe — (4, [ wear) 
TooaF é fee y) + fla) Li hla))as(e.y)due) 


<27*(b—a)7 Hane KE ©) {0,6}? [IlAllp,ta,elLn+1f lla,ta.b) 


F eset =: M3. (14.55) 
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Proof. We derive by generalized Holder’s inequality that 


b b 
al < ap Jf no) nar FOI ign, 9K (2) late 


b b 
F: i i | nea one) 


< sm (f [ noratar) ‘is i; [ IenssS(O)Rate) 


: Ilgn(z, ‘Kt, ©) || s,[a,8}2 


+ (ff \ncorratar) i (ff \eosantt tata) "ante 98 ene 


1 
~ 36—a)? C — a)? ||Pllp,fa,o(0 — @) || Ln f llq,ta,oill9n(@, t)K (t, 2)|lrfa,0]2 


+ (b= a)? ll fllp,ta,5)(0 — 4)" Ln+1lla,ta,t}ll9n(a, KE, eae 


b—a)i-= 
= Fe [ltlijosllenaa las 


+ [fllo.ta,o | Ln+1Fllg,ta,o1] IIGn(@ t)K (t, 2) lr,fa,0)2 > 


proving the claim. 
We conclude the Griiss type results with 
Corollary 14.18 (on Theorems 14.15, 14.16, 14.17). Let f, h, {ui}tp € 


C™*1([a,b]), n > 0, all W(x) > 0 on [a,b], i = 0,1,...,n. Let also p,q,r > 0 
such that Earns 4 =1. Then 


D < min{ My, Mo, M3}. (14.56) 


The related comparison of integral means follows, see also [48]. 


Theorem 14.19. Let f, {uj}%o € C”*!([a,b]), with uo(x) =c > 0, n > 0, all 
W(x) > 0 on [a,b], i = 0,1,...,n. Let pu be a finite measure of mass m > 0 on 
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({c, d], P([c, d])), [c,d] C [a,b] CR, where P stands for the power set. Then 


on! f(x)du(x) — te ff seas 


(n + 1)m [c,d] 
1 n—1 
+ ae iin 2 —i) 
[Lif(a) Steal gi+1(a, a)dpu(x) — Li f(b) Steal Gi41(2, b)dy(x)| 


b-a 


ra a [Eee a ate} 


b 
/ ate ni— 0a ‘ne I|Zn+iflloos 


+ 


(/ E (Ilgn (x; *)|loo max(x — a,b — x)) ini) |Entiflla, 


(/ j Ilgn(a, Dt lade) lens (14.57) 


where p,q > 1: Steal. 
Proof. By Theorem 14.14. 


Assumptions 14.20. Next we follow [140]. 

Let f be a convex function from (0,-+00) into R which is strictly convex at 1 
with f(1) = 0. Let (X,.A,,) be a measure space, where 4 is a finite or a o-finite 
measure on (X,A). And let j11, “2 be two probability measures on (X,.A) such that 


[i <A, po < X (absolutely continuous), e.g. A = (41 + Me. Denote by p = ue 


q= oe the (densities) Radon—Nikodym derivatives of 41, “2 with respect to A. 


Here we assume that 


Gee eS ae. on X anda<1<b. 
qd 
The quantity 


ryan) =f ales (B22) ance (14.58) 


was introduced by I. Csiszar in 1967, see [140], and is called f-divergence of the 
probability measures fz; and fz. By Lemma 1.1 of [140], the integral (14.58) is 
well-defined and T'y(41, 2) > 0 with equality only when 4; = pe. Furthermore 
T'(441, #2) does not depend on the choice of A. Here by assuming f(1) = 0 we can 


190 PROBABILISTIC INEQUALITIES 


consider I’ ¢(y1, 2), the f-divergence, as a measure of the difference between the 
probability measures j11, [2. 
Here we give a representation and estimates for T'f(j41, v2) via formula (14.35). 


Theorem 14.21. Let f and Ty as in Assumptions 14.20. Additionally assume 
that f, {ui}%@o € C”T ([a, b]), with uo(z) =c > 0, n> 0, all W;(t) > 0 on [a,b], 
i=0,1,...,n. Then 


Tr(u1, [2) 


= 24) Pf jypdy 


Son nf Seas Ee 


é b-a 
1=0 


b/ Li; pay p(x) aa 
A(Lol/ (Se ei ay ane) + Raa 


(14.59) 


where 


Rigs =f ( [on( Ad tannin (1-22) a) ce, (14.60) 


Proof. In (14.35), set x = e multiply it by q and integrate against A. 


The estimate for T'-(j11, U2) follows. 
Theorem 14.22. Let all as in Theorem 14.21. Then 


[Rr+1| 
< taint fam f n( A. t)||x(« He) la) axe | bens a,b) 
( [ als (2. \K (. uo) Ee Pe) Mens Mn 

[ale (2. JI. (ag 42) ava) lens Flsaaf 


(14.61) 
where pi, p2 > 1 such that on + oo =1. 
Proof. From (14.60) we obtain 


neal $ A [aol n( Bt) ener s(0)|m(t He) le) an 


= a (14.62) 
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We see that 


se A] fool 


Also we have true that 


re (fa 


Finally we derive 


In Ga t) | Kx (+ me) |) iN) ensat lle iaai: 
(14.63) 
oo Fay) (-a5) 


aC oo] ae 


b 
pla) 4 4 PY ancy 
max (2) % oe )) Mine aie (14.65) 


ive) |Entif lps [a,0)- 
p2,[a,b] 


(14.64) 
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Chapter 15 


Representations of Functions and 
Csiszar’s f-Divergence 


In this chapter we present very general Montgomery type weighted representation 
identities. We give applications to Csiszar’s f-Divergence. This treatment follows 
[62]. 


15.1 Introduction 


This chapter is motivated by Montgomery’s identity [265]: 


Theorem A. Let f : [a,b] — R be differentiable on [a,b] and f' : [a,b] — R 
integrable on [a, b]. 
Then the Montgomery identity holds 


1 P 1 ‘ * / 
f(z) = —/ f(t)dt + —/ P* (x,t) f (t)dt, 
where P*(x,t) is the Pedno Kernel defined by 


t—a 
> a, astK<g, 
pe |e x<t<b 
b-a? =r 


We are also motivated by the following weighted Montgomery type identity: 


Theorem B (see [294] Pecaric). Let w : [a,b] — R4z is some probability 
density function, that is, an integrable function satisfying f? w(t)dt = 1, and 
W(t) = f w(x)dx for t € [a,b], W(t) =0 fort <a, and W(t) =1 fort > b. 

Then 


b b 
f(x) = i. w(t) f(t)dt + / Pu (at) f'(t)at, 


where the weighted Pedno Kernel is 


Pyw(a,t) := 
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The aim of this chapter is to present a set of very general representation formulae 
to functions. They are 1- weighted and 2- weighted very diverse representations. 
At the end we give several applications to representing Csiszar’s f-Divergence. 


15.2 Main Results 


We need 


Proposition 15.1 ([32]). Let f : [a,b] — R be differentiable on [a,b]. The derivative 
f’ : [a,b] — R is integrable on [a,b]. Let g: [a,b] — R be of bounded variation and 
such that g(a) 4 g(b). Let x € [a, b]. 


We define 
Hata) 2S ES 2, 
Pw (g(x), g(t)) = (15.1) 
Hiaaay © <ES 
Then 
f(x) = aoa f 10 f (t)dg(t) +f P(g bf (dk, (15.2) 


We also need 


Theorem 15.2 ((241], p. 17). Let f € C"([a,6]), n EN, x € [a,b]. 
Then 


b 
en —/ Chee Pe One Cor (15.3) 
where form €EN we call 
m k-1 
Om(x) := S- C- dB, (=) [sD (0) Es f°-Dea)] (15.4) 
k=1 


with the convention ®o(x) = 0, and 


R,(x) = sey LB; (=) 3 (=) fM(t)dt. (15.5) 


Here B,(x), k > 0 are the Bernoulli polynomials, By = By,(0), & > 0, the 
Bernoulli numbers, and By, k > 0, are periodic functions of period one related to 


the Bernoulli polynomials as 
By(«) = B(x), O<a<1, (15.6) 
and 


Bi(a+1) = Bea), «ER. (15.7) 
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Some basic properties of Bernoulli polynomials follow (see [1], 23.1). We have 


1 1 
Bo(x) = 1, By(z) =2—5, Bo(z) = 2° —a +=, (15.8) 

and 
Bi(x) =kBr_i(x), KEN, (15.9) 
By (2 +1) — By(x) = ka*', k > 0. (15.10) 


Clearly By = 1, BY ts a discontinuous function with a jump of -1 at each integer, 
and BE, k > 2, is a continuous function. 

Notice that B,(0) = B,(1) = Br, k = 2. 

We need also the more general 
Theorem 15.3 (see [53]). Let f : [a,b] — R be such that f"-), n > 1, is a 


continuous function and f™ exists and is finite for all but a countable set of x in 
(a,b) and that f™ € L,([a,b]). Then for every x € [a,b] we have 


= hfs i) yar Oe aes br (F=*) [FP @) - f° M@) 


ae iz B, an = Be (=)| FO (t)at. (15.11) 


The sum in (15.11) when n = 1 is zero. 

If f("— is just absolutely continuous then (15.11) is valid again. 
Formula (15.11) is a generalized Euler type identity, see also [143]. 
We give 


Remark 15.4. Let : g be as in Proposition 15.1, then we have 


f(z) = ao f 1 )dg(x + f Pie x)) f'(x)dx. (15.12) 


Further assume that a be such that f("-), n > 2, is a continuous function 
and f(a) exists and is finite for all but a countable set of x in (a,b) and that 
f™ €L1([a,4)). 


Then, by Theorem 15.3, we have 


fi(x) = iO) Aa) + Ty—2(2) + Rn—i(2), (15.13) 
where 
n—2 k-1 
Tra(a) = YO, (F=2) [MO — 1 @], 5.4) 
k=1 : 
and 


Rn—1(2) - af [Bi (=) oa: er (=) f(t)dt. (15.15) 
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The sum in (15.14) when n = 2 is zero. 
If f("— is just absolutely continuous then (15.13) is valid again. 
By plugging (15.13) into (15.12) we obtain our first result. 


Theorem 15.5. Let f : [a,b] > R be such that f"-), n > 2, is a continuous 
function and f™ exists and is finite for all but a countable set of x in (a,b) and 
that f(™) € Ly({a,b]). Let g: [a,b] > R be of bounded variation and g(a) £ g(b). 
Let z € [a,b]. 


Then 
ae ae f0)- fla) f’ 
fe) = aaa | Feds) + Sa f Prate),a@)yae 
+ Ped (4) F@) [ Plo2).91)) Bs (=) da 
k=1 g 
agree FP » ra 
+ S29 | Pae). ato) ( [ (e (F) 
may Se (—)) f° (oat) de. (15.16) 
Here, 
Sasa 7S tS 2, 
P(g(2),9(t)) = (15.17) 
BoE, z<t<b. 


The sum in (15.16) is zero when n = 2. 
We need Fink’s identity. 


Theorem 15.6 (Fink, [195]). Let anya,b€R, a 4b, f: [a,b] >R,n>1, fr) 
is absolutely continuous on [a, b]. 


Then 
Pf pyar (BAK) (LP OW= ak — FE PO — HE 
fle) = 5" f sae > =) ( eae 
1 : n—-1 (n) 
may f enon KEIM Oar, (15.18) 
where 


t-—a,a<t<a<b, 
K(t, 2) = {oe eee (15.19) 


When n = 1 the sum ee, in (15.18) is zero. 
We make 
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Remark 15.7. Let : g be as in Proposition 15.1, then we have 


f(z) = aoa f ie )dg(x + f Pte x)) f' (a) da. (15.20) 


Further suppose that f’ be such that f("—),n > 2, is absolutely continuous on 
[a, 0]. 


Then 
Fi (WAL) Ho) 
n—2 ee = f(b)(a — b)* — fF (a)(a — a)F 
So 
b 
7 a, (@-O" “KE AIM wa. (15.21) 


When n = 2 the sum 5, ; in (15.21) is zero. 
By plugging (15.21) into (15.20) we obtain 


Theorem 15.8. Let f : [a,b] > R be such that f"-, n > 2, is absolutely 
continuous on [a,b]. Let g : [a,b] > R be of bounded variation and a # b, g(a) # 
g(b). Let z € [a,b]. 

Then 


1 b 


b 
P(g(z), 9(2)) (/ (x —t)” ?K(t, x) f™ iat) dx. 


(15.22) 


a 


When n = 2 the sum So , ; in (15.22) is zero. 
We need to mention 


Theorem 15.9 (Aljinovic- Pecaric, [14]). Let’s suppose f-), n > 1, is a 
continuous function of bounded variation on [a,b]. If w : [a, oh — Ry is some 
probability density function, t.e. integrable function satisfying few t)dt = 1, and 


W(t) -| w(x)dx for t € [a,b], 


W(t) =0 fort <a and W(t) =1 fort > b, the weighted Pedno Kernel is 


W(t), a<t<a, 
Pyw(a,t) = (15.23) 


W(t)-1, e<t<b. 
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Then the following identity holds 


>> va ( ’ "Pw (0) Be (= “) i) Oa 7" @) 
k=1 ¢ 


taf ( [Peto [Bs (=) = Dy (=)| i) ap (2), 


(15.24) 


We make 


Assumption 15.10. Let f : [a,b] ~ R be such that f—-), n > 1, is a continuous 
function and f(")(x) exists and is finite for all but a countable set of x in (a,b) and 
that f( € L1([a, b)). 
We give 

Comment 15.11. Let f as in Assumption 15.10. Then f(~)(x) absolutely 
continuous and thus of bounded variation. Furthermore, we can write in the last 
integral of (15.24) 

df—Y(t) = fMt)dt. (15.25) 


So, if we assume in Theorem 15.9 for f that f(—) is absolutely continuous we 
have that (15.24) and (15.25) are valid. 
We make 


Remark 15.12. Let f,g be as in Proposition 15.1 , then we have 


1 b b ; 
fle) = eae fle date) + f Plol=).a(@))F' we (15.26) 


Further assume that f be such that f"-), n > 2, is a continuous function 
and f(")(x) exists and is finite for all but a countable set of x in (a,b) and that 
f) © Ly ((a, 8). 

Then using terms of Theorem 15.9 we find 


i ( [ Pwte.s) [Bia (4) - 32 (4) is) FO (at. 
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Plugging (15.27) into (15.26) we get 


Theorem 15.13. Let f : [a,b] > R be such that f"-, n > 2, is a continuous 
function and f(x) exists and is finite for all but a countable set of x in (a,b) and 
that f(™ € Li({a, 8). 

Let g: [a,b] > R be of bounded variation and g(a) 4 g(b). 

Let w,W and Py as in Theorem 15.9, and P as in (15.1). Let z € [a,b]. Then 


(2) = ava (a)dgla + ( fr Pla(2),g(a))dex uo) ( | ‘nto soa) 
ver : (#0) f®(a)) 
aoe »( frvenmin (F=2) at) ar Sor 
[reece] f (frees [i.2 (T=) 
na 3) Pa oa 


We need 


Theorem 15.14 (sce [16], Aljinovic et al) Let f : [a,b] + R be such that f"-, 
is an absolutely continuous function on [a,b] for some n > 1. 

Ifw: [a,b] ~ Ry is some probability density function, then the following identity 
holds: 


b n—1 n-1 b 
ya = / w(t) f(t)at — S70) +O / w(t) Fe (t)dt 
a k=1 k=1° 


b 
7 acraca | (a — y)”""K(y, 2) f™ (y)dy 


b b 
S acnraca | (/ viote a'r) f™(y)dy. (15.29) 


F, (x) := eS *) (eis ee nee) (15.30) 


t-a,a<t<a2<b, 
(15.31) 


t-ba<a<t<b. 
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We make 


Remark 15.15. Same assumptions as in Theorem 15.14 and n > 2. 
Here put 


F(a) := (“—*) (Pie orden) (15.32) 


Thus by (15.29) we obtain 


b b 
~ pea | ( i w(t)(t — nax(y.0a) f (y)dy. 


(15.33) 


Let g : [a,b] — R be of bounded variation and a # b, g(a) 4 g(b). 
Then, by Proposition 15.1, it holds (15.26). 
By plugging in (15.33) into (15.26) we obtain 


Theorem 15.16. Let f : [a,b] > R be such that f- is an absolutely continuous 
function on [a,b] for n > 2. Here w : [a,b] > Rx is some probability density 
function, and z € |a, J. 

Let g: [a,b] > R be of bounded variation with a # b, g(a) # g(b). Then 


1 b b 
f@)= ay : f(a)dg(x) + (a [ Po ae) (fe wert) 
n—-2 ab b n—-2 ab 
- P(g(2),9(2))Fe(ade + (| f P(o(2),9(@))ae w(2)F,(x)dx 
d. .) g(2),g k ( i. g(2),g - (s i 


b b 
+oaece | Pioeno@) ( / WE (a.2 Fahy da 
1 b 
atl Palate) 
b b 
| | ( ,, vio(t a"? ba) f(y)dy. (15.34) 


Here F(x) is as in (15.32). 


We mention 
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Background 15.17 (see {144]). Harmonic representation formulae. Assume that 
(P(t), & > 0) is a harmonic sequence of polynomials i.e. the sequence of polyno- 
mials satisfying 


Pea Pian); kee Pod) a4. (15.35) 
Define P;(t), k > 0, to be a periodic function of period 1, related to P(t), k > 0, 
as 
Pelt) = Pelt), O<t<1, 
Py(t+1)= Pit), teER. (15.36) 
Thus, Pj (t) = 1, while for k > 1, Pf(£) is continuous on R \ Z and has a jump of 
ay = P,,(0) — Pi (1) (15.37) 


at every integer t, whenever a; # 0. 
Note that a; = —1, since P\(t) = t+, for some cé R. 
Also, note that from the definition it follows 


Pi(t)=Pei(t), k>1,teER\Z. (15.38) 
Let f : [a,b] = R be such that f("-) is a continuous function of bounded 


variation on [a,b] for some n > 1. 
In [144] the following two identities have been proved: 


ee be 2 x 
f(x) = — | f(t)dt + Tr(z) + Tr(a) + Rh (a) (15.39) 
and 
| ieee ie 5 e 
f(x) = —/ f(t)dt + Ty_1(x) + tm(x) + R2 (2), (15.40) 
where 
To d(b —a)*"' Py (F — “) f°) — f° %@), (15.41) 
for 1<m<n, and 
Tm(#) := oe a)*ta, f—-) (x), (15.42) 
k=2 
with convention To(a) = 0, 71(x) = 0, while 
P -— —(bh—a)"— {rt (n-1) 
RG@=o6-ae iz P* (FE *) df(t) (15.43) 


and 


R(x) = -(b- otf Es (=) ae (=) df(t). (15.44) 


The last two integrals are of Riemann- Stieltjes type. 
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We make 


Remark 15.18. Let f : [a,b] > R be such that f("~ is a continuous function of 
bounded variation on [a,b] for some n > 2. 
Then we apply (15.40) for f’ and for n — 1 to obtain 


f(a) = Koo) + Ty—2(z) + Ta-1(z) + R2_, (2), (15.45) 
where 
Iya(0) = (0-1, (F =) (4) - f(@)), (15.46) 
k=1 
T-1(2) := sr (b — a)*-loy, f) (x) (15.47) 
k=2 
with convention 
To(x) =0, m1(x) =0, (15.48) 
while 
[2 v) = —(b—a)"-2 * z—t _ ; va (n—1) 
Fa) = 0-0)" f | Pha (FEE) - Pea (FS) we. 


Let g : [a,b] — R be of bounded variation with a 4 b, g(a) 4 g(b). 
Then by plugging (15.49) into (15.26) we obtain 


Theorem 15.19. Let f : [a,b] > R be such that f("-) is a continuous function 
of bounded variation on [a,b] for some n > 2. Let g : [a,b] — R be of bounded 
variation with a # b, g(a) # g(b). The rest of the terms as in Background 15.17 
and Remark 15.18. Let z € [a, 6). 


Then 
Ss : , x)dg(x z), g(a) )dx Oe) 
Ie eae ue (/ P(g(z), g(a))d ( aR ) 
a sb art (4) f(a) P(g(2), g(@)) Pr (= *) q 
k=1 7 
n-1 b b 
+6 alan [PGl2) fe) F (wide + 6- ay"? [” PEG(2),9(0) 
k=2 ie ° 
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If f™ exists and is integrable on [a,b], then in the last integral of (15.50) one 
can write 


a a reag de, 


We need 


Background 15.20 (see [12]). General Weighted Euler harmonic identities. For 
a,b € R, a < 6b, let w : [a,b] — [0,co) be a probability density function i.e. 
integrable function satisfying 


[wae =i, (15.51) 
For n > 1 and t € [a, 5] let 
1 ; n-1 
Wn(t) = aa (t — 8)" *w(s)ds. (15.52) 
Also, let 
wo(t) = w(t), t € [a,b]. (15.53) 


It is well known that w,, is equal to the n- th indefinite integral of w, being equal 
to zero at a, i.e. wh (t) = w(t) and wp(a) = 0, for every n > 1. 

A sequence of functions H,, : [a,b] > R, n > 0, is called w- harmonic sequence 
of functions on [a, }] if 


Hi (t) = Hn-i(t), t#1; Ho(t) = w(t), te [a, 9]. (15.54) 
The sequence (w,(t), t 4 0) is an example of w- harmonic sequence of functions 
on [a, 6]. 
Assume that (H,,(t), n > 0) is a w- harmonic sequence of functions on {a, 6]. 
Define H*(t), for n > 0, to be a periodic function of period 1, related to H,,(t) 


as 
Gy SO) eee (15.55) 
(b—a)” 
He(t+1)=H%(t), te R. (15.56) 


Thus, for n > 1, H}(t) is continuous on R \ Z and has a jump of 


H,,(a) — Hy,(b) 


= 15.57 
B: (6—a)r ( ) 
at every t € Z, whenever {,, # 0. 


Note that 


= (15.58) 
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since 
Hi Sinoeer i Waite: (15.59) 
for some c € R. 
Also, note that 
Hi) SH af), nel, teR YZ. (15.60) 


Let f : [a,b] — R be such that f("-!) exists on [a,b] for some n > 1. For every 
x € [a,b] and 1 <_m <n we introduce the following notation 


a) = S7 Halex) [FO-Y(0) - FY, @] +O [Hi(a) — Hi(B)] f° (2), 
k=1 


(15.61) 
with convention 5)(x) = Hi (x) [f(6) — f(a). 


Theorem 15.21 (see [12]). Let (H;, k > 0) be w- harmonic sequence of functions 
on [a,b] and f : [a,b] > R such that f"-) is a continuous function of bounded 
variation on [a,b] for some n> 1. 

Then for every x € [a,b] 


n= fro f(t)W,(t)dt + S,(x) + Rp (2), (15.62) 
where 
1 -— _(h — gq)” »{z—t (n—1) 
Ne ee eee Vee H* & *) df" (1), (15.63) 
with 


(15.64) 


Also we have 


Theorem 15.22 (see [12]). Let (H;, k > 0) be w- harmonic sequence of functions 
on [a,b] and f : [a,b] > R such that f"-) is a continuous function of bounded 
variation on [a,b] for some n> 1. 

Then for every x € [a,b] andn > 2 


a= ff F(t)W,(t)dt + Sp_a(a) + [Hp(a) — Hy(b)] f(x) + R2(@), (15.65) 


while for n = 1 


f(z) = | f(t)We(t)dt + Ri (2), (15.66) 


Rig] 6a is |i; ( = -) a ae df(t), (15.67) 
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forn>1. 


Comment 15.23 (see {12]). In the case when y : [a,b] — R is such that y’ 
exists and is integrable on [a, b], then the Riemann- Stieltjes integral Sia. g(t)dp(t) 


is equal to the Riemann integral fe g(t)’ (t)dt. 
Therefore, if f : [a,b] > R is such a f™ exists and is integrable on [a,b], for 
some n > 1, then Theorem 15.21 and Theorem 15.22 hold with 


R(x) = —(b- a" | H* & -) f(t)dt, (15.68) 


[a,b] 


R2(z) := —(b— a)” iz i; (=) z a FO (t)dt. (15.69) 
We make 


and 


Remark 15.24. We use the assumptions of Theorem 15.22, n > 2. We apply 
first (15.65) for f’ and n— 1 to get for n > 3, 


y= f f' (Q)We(t)dt + Sn—a(a) + [Hn—1(a) — Hn—1(0)] f°) (a) + R2_, (2), 
(15.70) 
where 
5,o() = > Hel) (40) — f(a) + > (Ha(a) — Hi(b)) fe), (18.71) 
k=1 k=2 
and 
S1(@) = Hi(x)(f"(0) — f(a), (15.72) 


while for n = 2, by (15.66), we obtain 


y= f row (t)dt + R? (2), (15.73) 


here, by (15.67), we have 


#2_,(c) =-(b- a)" i cs ( = -) _ a] df"-D(t), (15.74) 


a)—! 


for n > 2. 
Plugging (15.70), (15.73) into (15.26) we derive 


Theorem 15.25. Let (H,, k > 0) be w- harmonic sequence of functions on [a, b] 
and f : [a,b] + R such that f°-) is a continuous function of bounded variation 
on [a,b] for some n > 2. 

Let g : [a,b] — R be of bounded variation on [a,b] such that g(a) # g(b). Here 
W,, is as in (15.64). Let z € [a,b]. 
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Ifn > 3, then 


+(b-a)""! [ P(g(z),9(2)) ( i ote _ A, (=)) a) de. 


If n = 2, then 


1 b 
(2) = ee | Hle)dale) + 


b b 
acon) ( / rowetoat) ts 


+(b- a) a P(g(z), 9(2)) (., es — Ay (=)) w() dx. (15.76) 


If f™ exists and is integrable on [a,b], then in the last integral of (15.75) one 
can write df°-)(t) = f(™ (t)dt. 

Also if f” exists and is integrable on [a,b], then in the last integral of (15.76) 
one can write df'(t) = f’(#)dt. 

We mention 


Theorem 15.26 (see [36]). Let f : [a,b] > R be n- times differentiable on [a,b], n € 
N. The n-th derivative f™ : [a,b] > R is integrable on [a,b]. Let g : [a,b] = R 
be of bounded variation on [a,b] and g(a) 4 g(b). Let t € [a,b]. The kernel P is 


defined as in Proposition 15.1. 
b 
/ f(s; )dg(s1) 
k 


Then, we find 
1 b 1 
19 = aaa | MOH + Tea 
b b 
/ 7 | P(g(t),9(s1)) [][ Pa), 9(si41))dsidsz ++ dSp41 
- 7 feat 


n—2 


k=0 


(k+1)th—integral 
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b b n-1 
+f af P(g(t), 9(s1)) [] Pols), (sis) (on )ds1 ++ d8n. (15.77) 


We make 
Remark 15.27. —All assumptions as in Theorem 15.26. Let w(t) continuous 
function on [a,b] such that cfs w(t)g(t)dt = 1. 

Then, by (15.77) we obtain 


n—-2 b 
fu(t) = (s1)dy(s) +$ = — » ( | Hay) 
b b k 
Pf Palts.s(ov) TL P(e). a6sier))dsidsa+- ds 


i=l 
(k+1)th—integral 


n-1 


b b 
@ | | P(g(t), g(s1)) LT Plo(s.), 9(si41)) FO (6n)ds1 +++ dsm, (15.78) 


Hence 


b (+1) (5 )dg(s1 
| sowtoastt) = spo | f(er)da(er) +S ee 


b b b k 
, / w(t) ( i 4 / mee ate dg(t) 


b b b 
+f w() ( fof Pla) a) TT Plaats) Hse) dt). 


(15.79) 
Next we subtract (15.79) from, by Theorem 15.26 , original formula 


aol f(s1)dg(s1) aa ([ f era) ( (sa)) 
b b k 
= ; vs | P(g(z), 9(s1)) T] P(a(si),9(si+1))dsrds ce] 


b b n-1 
+ fo fF Pll2),9(61)) T] Plates), olsen) (on)dsi-+-dsp, V2 € [a8 
i (15.80) 
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where 
g(t)—9(a) 
aga: ¢ StS 
P(g(z), g(t) = ie (15.81) 
g(t)—g(b 
gagay? 2 <ES 8. 
We derive 
Sf. f*) (81)dg(s1) 
Hey f Howeast +E 


b b b k 
| i w(t) i is / sdiniptauiniess dg(t) 


i=1 


i=1 


b b b k 
= / w(t) ( / rc / P(a(t),9(s1)) |] P(a(si), a(se41))dsidsa-- ce] dg(t)] 


b b b n-1 
+f w() fof Plo()ate) T] Pets otvad Aone Ag dg(t) 


i=1 


b b b n-1 
- | w() ( fof Pla) a) TT Plaats) Hse) dg). 


(15.82) 
We have proved the general 2- weighted Montgomery representation formula. 
Theorem 15.28. Let f : 


[a,b] — R be n- times differentiable on [a,b], n € N. 
The n-th derivative f(™ 


: [a,b] > R is integrable on [a,b]. Let g : [a,b] > R be of 
bounded variation on [a,b] and g(a) 4 g(b). Let z € [a,b]. The kernel P is defined 
as in (15.1). Let further w € C([a,6]) such that f? w(t)g(t) =; 

Then 


r= [row > HM eneate) ee 
i Pa [P(g(z), 9(81)) — P(g(t), 9(s1))] 
Piso g(si+1))ds1ds9-+-dsp41dg(t) 
+ [i [we Pate).9(61) ~ Plol0, 900) 
Tl P(9(si), 9(8i+1)) f (Sn)ds1 ++ dsndg(t). (15.83) 


i=1 
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Notice that the difference [P(g(z), g(s1)) — P(g(t), g(s1))] takes on only the val- 
ues 0, +1. 

We apply above representations to Csiszar’s f-Divergence. 

For earlier related work see [43], [101], [103]. 

We need 


Background 15.29. For the rest of the chapter we use the following. Let be a 
convex function from (0,00) into R which is strictly convex at 1 with f(1) =0. Let 
(X,A,) be a measure space, where X is a finite or a o-finite measure on (1’,.A). 
And let j11, 42 be two probability measures on (¥,.A) such that ur << A, wa K A 
(absolutely continuous), e.g. A = ti + pa. 

Denote by p = fun q= sie the (densities) Radon-Nikodym derivatives of j11, 2 
with respect to A. 

Here we assume that 


0<a<e <b, ae.onX anda<1<b. (15.84) 
qd 
The quantity 
x 
Tylana) = fate) (Bh) arc (15.85) 
x q(x) 


was introduced by I. Csiszar in 1967, see [140], and is called f-divergence of the 
probability measures 41 and ~2. By Lemma 1.1 of [140] the integral (15.85) is 
well-defined and T'y(f11, w2) > 0 with equality only when py = fe. Furthermore 
T'(t1, U2) does not depend on the choice of A. 

Here by assuming f(1) = 0 we can consider T'y(j11, ju2), the f-divergence as a 
measure of the difference between the probability measures j11, Mo. 

Furthermore we consider here f € C”([a,b]), n € N, g € C({a,b]) of bounded 
variation, a £ b, g(a) 4 g(b). 

We make 


Remark 15.30. All as in Background 15.29 , n > 2. By (15.16) we find 


(5) = aorta f Poorer (6() at) ) (AS) 


(/ ee (=) — Bhs (=)| oat da. (15.86) 


By integrating (15.86) we obtain 
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Theorem 15.31. All as in Background 15.29 , n> 2. 
Then 


f(a, [2) moe ff )dg(x 


a wer (Lot(3}»0))3 


Bac : 
(LoL (2) (Le G2) 
£5 (=)) f(°"(bde) a) ax) (15.87) 


Using (15.22) we obtain 
Theorem 15.32. All as in Background 15.29, n > 2. 
Then 


V5 (41, 2) 


; (< 7 or 2Kb9 FDA) i) a) (15.88) 


When n = 2 the sum peda in (15.88) is zero. 
Using (15.28) we get 


Theorem 15.33. All as in Background 15.29, n > 2. 
Here Pw as in Theorem 16.9. 
Then 


f (la, [12) af f(a )dg(x 


(/ (/ Pw(z, s)[B*_» (=) ss Bites (=2)ie) (pda) dd). 


(15.89) 
Using (15.34) we obtain 
Theorem 15.34. All as in Background 15.29, n > 2. Here w : [a,b] > Rx is a 


probability density function, and F', is as in (15.32). 
Then 


ls ‘ie [12) 


on ations 

-¥ (La(f (02) 2) Fee 

+ (EL meron) ({a( fr (6(8) 0) «) 9) 
saab (Kel fr@) 0) 

, ( iy lee yr *(y.2)f% a) is) a) 

- waaay (Le Lo) #0) ) 9) 

; ( | ( i “wll rua yyy) (15.90) 
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Using (15.50) we find 
Theorem 15.35. All as in Background 15.29, n > 2. The other terms here are as 
in Background 15.17 and Remark 15.18. 

Then 


T's (e1, 12) 


-aoa | Fore 
+(aet) ({4(f°»((8) 0) 


+ D— ay" (F() — F(a) 


man (fal f(s (2) ate) !) 1 Ar) ar) 6-0) 
i i r(o(2) 9) (Ll GS) 
— Pia (F=*)] oar) ae) ar), 


Using (15.75) and (15.76) we obtain 


(15.91) 


ae 


Theorem 15.36. All as in Background 15.29, n > 2. Let (Hy, k > 0) be w- 
harmonic sequence of functions on [a,b]. Here W, is as in (15.64). 
Ifn > 3, then 


T(t, 2) 
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(, 5 (<= a oe An-1 (=)) Hoa i) a) (15.92) 


Ifn = 2, then 


i e _ Ht (=)) ret) i) a) (15.93) 


Using (15.83) we obtain 


Theorem 15.37. All as in Background 15.29. Let w € C({a,b]) such that 


J? w(t)g(t)dt = 1. 
Then 


fo fF) (s1)dg(s1) 
f (Ha, f2) -[ f(t) Cara 


Ue " Hs (o(P/a),9(81)) ~ Pla(t), (61) 


[tas g(sit1))ds1 +++ dspyidg(t 4 


mie [ _[ woPacria).s(o0) ~ Plot a6) 


TI Pto(s 9(si41)) f (sn)dsi -- we ah (15.94) 


Note 15.38. All integrals in Theorems 15.31 - 15.37 are well-defined. 
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Chapter 16 


About General Moment Theory 


In this chapter we describe the main moment problems and their solution methods 
from theoretical to applied. In particular we present the standard moment problem, 
the convex moment problem, and the infinite many conditions moment problem. 
Moment theory has a lot of important and interesting applications in many sciences 
and subjects, for a detailed list please see Section 16.4. This treatment follows [28]. 


16.1 The Standard Moment Problem 


Let g1,.--;gn and h be given real-valued Borel measurable functions on a fixed 
measurable space X := (X,A). We would like to find the best upper and lower 
bound on 


lh) = J neater 
given that ju is a probability measure on X with prescribed moments 
folmlat) =i. =A, em 
Here we assume yz such that 
f ssln(at) <+00, ¢=1,...50 
x 
and 
| |h| (dt) < +00. 
xX 
For each y := (y1,---,Yn) € R”, consider the optimal quantities 
L(y) := L(y | h) == inf u(A), 
U(y) = U(y| h) == sup u(h), 
U 
where py is a probability measure as above with 
Mg:)=Yi, t=1,...,n. 
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If there is not such probability measure jz we set L(y) := +00, U(y) := —oo. 

If h := ¥g the characteristic function of a given measurable set S of X, then we 
agree to write 

L(y | ¥s) = Ls(y), U(y | Xs) = Us(y). 

Hence, Ls(y) < pu(S) < Us(y). Consider g : X — R” such that g(t) := 
(gi(t),.--,9n(t)). Set also go(t) := 1, allt € X. Here we basically describe Kemper- 
man’s (1968)([229]) geometric methods for solving the above main moment problems 
which were related to and motivated by Markov (1884)([258]), Riesz (1911)([308]) 
and Krein (1951) ({238]). The advantage of the geometric method is that many times 
is simple and immediate giving us the optimal quantities L, U in a closed-numerical 
form, on the top of this is very elegant. Here the o-field A contains all subsets of 
Xx. 

The next result comes from Richter (1957)([307]), Rogosinsky (1958)((25]) and 
Mulholland and Rogers (1958) ({269]). 


Theorem 16.1. Let f1,..., fn be given real-valued Borel measurable functions 
on a measurable space Q (such as gi,..-,gn and h on X). Let ys be a probability 
measure on Q such that each f; is integrable with respect to js. Then there exists a 
probability measure yp’ of finite support on Q (i.e., having non-zero mass only at a 
finite number of points) satisfying 


f sloutaey = f peen'cae), 
Q Q 
alli=1,...,N. 


One can even achieve that the support of y' has at most N +1 points. So from 
now on we can consider only about finite supported probability measures. 


Set 
V := conv g(X), 


(conv stands for convex hull) where g(X) := {z € R" : z= g(t) for some t € X} is 
a curve in R” (if X = [a,b] C R or if X = [a,b] x [c,d] C R?). 

Let S Cc X, and let M+(S) denote the set of all probability measures on X whose 
support is finite and contained in S. 


The next results come from Kemperman (1968) ({229]). 


Lemma 16.2. Given y © R", theny €V iff du € M*(X) such that 
wa) = (i-ew(a) =f g(t)alat) = os = 1-47). 
Hence L(y | h) < +00 iffy € V (see that by Theorem 16.1 
L(y | h) =inf{u(h) : we M*(X), w(g) = y} 


and 


U(y | h) = sup{u(h) : w € M*(X), u(g) = y}). 
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Easily one can see that 
L(y) == L(y | h) 
is a convex function on V, i.e. 
L(Ay! + (1 = A)y") < AL(y’) + (= ALY"), 
whenever0 <A <1 andy',y” € V. Also U(y) := U(y | h) = -Lly | —A) is a 
concave function on V. 


One can also prove that the following three properties are equivalent: 


(i) int(V) := interior of V F ¢; 
(ii) g(X) is not the subset of any hyperplane in R”; 
(iii) 1,91, g2,---;9n are linearly independent on X. 


From now on we suppose that 1, g1,..., gn are linearly independent, i.e. int(V) 4 ¢. 
Let D* denote the set of all (n + 1)-tuples of real numbers d* := (do, di, ..., dn) 
satisfying 


h(t) >do+ > digi(t), all te xX. (16.1) 
i=1 
Theorem 16.3. For each y € int(V) we have that 


n 


i=1 

Given that L(y | h) > —oo, the supremum in (16.2) is even assumed by some 
d* € D*. If L(y | h) is finite in int(V) then for almost all y € int(V) the supremum 
in (16.2) is assumed by a unique d* € D*. Thus L(y | h) < +00 in int(V) iff 
D* # ¢. Note that y := (y1,.--,Yn) € int(V) C R” iff do + o_, diyi > 0 for 
each choice of the real constants d; not all zero such that do + d>¥_, digi(t) > 0, all 
te xX. (The last statement comes from Karlin and Shapley (1953), p. 5([222]) and 
Kemperman (1965), p. 578 ,([228]).) 


If h is bounded then D* F 4, trivially. 
Theorem 16.4. Let d* € D* be fixed and put 


B(d*) := {: = g(t): do + San =h(t),te x| (16.3) 


Then for each point 
y € conv B(d*) (16.4) 
the quantity L(y | h) is found as follows. Set 


y = >_ p39(t;) 
j=l 
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with 
g(t;) € B(d*), 
and 
p; = 9, Sop; =1. (16.5) 
j=l 
Then 
L(y | h) = S- pgh(t;) = do + S— diyi- (16.6) 
j=l i=1 


Theorem 16.5. Let y € int(V) be fixed. Then the following are equivalent : 


(i) Ju € M*(X) such that u(g) = y and p(h) = L(y | h), i.e. infimum is attained. 
(ti) Ad* € D* satisfying (16.4). 


Furthermore for almost all y € int(V) there exists at most one d* € D* satisfying 
(16.4). 


In many situations the above infimum is not attained so that Theorem 16.4 is 
not applicable. The next theorem has more applications. For that put 


n(z) = Tia EL) :t € X,|g(t) — z| < of. (16.7) 
Ife =>0 and d* € D*, define 
C.(d*) := {2€ a} :0<nl2)- Sain ze}, (20 = 1) (16.8) 
i=0 
and 


G(d*) := q conv Cyn (d*). (16.9) 
N=1 


It is easily proved that C.(d*) and G(d*) are closed, furthermore B(d*) C Co(d*) C 
C.(d*), where B(d*) is defined by (16.3). 


Theorem 16.6. Let y € int(V) be fired. 
(i) Let d* € D* be such that y € G(d*). Then 
Ly | h) = do + diyi +--+ + dnyn. (16.10) 
(ii) Suppose that g is bounded. Then there exists d* € D* satisfying 
y € conv Co(d*) C G(d*) 
and 


L(y | hk) = do t+ diy +--+ + dnyn- (16.11) 
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(iii) We further obtain, whether or not g is bounded, that for almost all y € int(V) 
there exists at most one d* € D* satisfying y € G(d*). 


The above results suggest the following practical simple geometric methods for 
finding L(y | h) and U(y | h), see Kemperman (1968),([229]). 


I. The Method of Optimal Distance 
Put 
M := conviex(gi(t),-.--, gn(t), R(t). 


Then L(y | h) is equal to the smallest distance between (y1,..-,Yn,0) and 
(y1,---;Yn,2) € M. Also U(y | h) is equal to the largest distance between 
(Y1,---5Yn,0) and (y1,-.-,Yn,2%) € M. Here M stands for the closure of M. In 
particular we see that L(y | h) = inf{yn4i: (y1,---.Yn; Yn+i) € M} and 


U(y | bh) = sup{ynsi : (Y1,---5Yns Yngi) © M}. (16.12) 


Example 16.7. Let 4 denote probability measures on [0,a], a > 0. Fix0<d<a. 
Find 


L:= int [ tu(dt) and U:= sup t? (dt) 
H J{0,a] bw J[0,a] 


subject to 


| tu(dt) = d. 
[0,a] 


So consider the graph G := {(t,t?) :0 <t <a}. Call M := conv G = conv G. 

A direct application of the optimal distance method here gives us L = d? (an 
optimal measure p is supported at d with mass 1), and U = da (an optimal measure 
u here is supported at 0 and a with masses (1 — 4) and ¢, respectively). 


II. The Method of Optimal Ratio 
We would like to find 

Ls(y) == inf p(S) 
and 

Us(y) := sup H(S), 
over all probability measures jz such that 

(gi) = yi, F= 1.0.50. 

Set S’ := X — S. Call Wg := convg(S), Wg := convg(S’) and W := convg(X), 


where g := (g1,---;9n): 
Finding L(y) 
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1) Pick a boundary point z of W and “draw” through z a hyperplane H of support 
to W. 

2) Determine the hyperplane H’ parallel to H which supports Wg: as well as 
possible, and on the same side as H supports W. 

3) Denote 


Ag:=WoOH=Ws04H and 
Ba:= Ws NH’. 


Given that H’ # H, set Gq := tonv(Aq U Ba). Then we have that 


Ls(y)=——, (16.13) 


for each y € int(V) such that y € Ga. Here, A(y) is the distance from y to H’ and 
A is the distance between the distinct parallel hyperplanes H, H’. 
Finding Us(y) (Note that Us(y) = 1— Lg-(y).) 


1) Pick a boundary point z of Ws and “draw” through z a hyperplane H of support 
to Wg. Set Ag:=Ws NA. 

2) Determine the hyperplane H’ parallel to H which supports g(X) and hence W 
as well as possible, and on the same side as H supports Wg. We are interested 
only in H’ 4 H in which case H is between H’ and Ws. 

3) Set Ba: = WO A’ =Ws NH’. Let Ga as above. Then 


A 
Us(y) = EF for each y € int(V), where y € Ga, (16.14) 


assuming that H and H’ are distinct. Here, A(y) and A are defined as above. 


Examples here of calculating Ls(y) and Us(y) tend to be more involved and 
complicated, however the applications are many. 


16.2 The Convex Moment Problem 


Definition 16.8. Let s > 1 be a fixed natural number and let xo € FR be fixed. By 
ms(%o) we denote the set of probability measures ys on R such that the associated 
cumulative distribution function F possesses an (s — 1)th derivative F—))(2) over 
(a, +00) and furthermore (—1)§F&~)(a) is convex in (29, +00). 


Description of the Problem. Let g;, i = 1,...,n; h are Borel measurable 
functions from R into itself. These are supposed to be locally integrable on [a9, +00) 
relative to Lebesque measure. Consider  € ms(20), s > 1 such that 


wllail) = ff lailtladt) <+00, $= 1)... (16.15) 
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and 
p([h]) = e |h(t)|u(dt) < +00. (16.16) 
Let c:= (c1,.--,€n) € R” be such that 
M(gi) = ci, t=1,...,n, pe © mMs(ao). (16.17) 
We would like to find L(c) := inf, u(h) and 
U(c) := sup p(h), (16.18) 
“ 


where ps is as above described. 
Here, the method will be to transform the above convex moment problem into 
an ordinary one handled by Section 16.1, see Kemperman (1971)([230]). 


Definition 16.9. Consider here another copy of (R, B); B is the Borel o-field, and 
further a given function P(y, A) on Rx B. 

Suppose that for each fixed y € R, P(y,-) is a probability measure on R and for 
each fixed A € B, P(-, A) is a Borel-measurable real-valued function on R. We call 
Pa Markov Kernel. For each probability measure v on R, let w := Tv denote the 
probability measure on R given by 


wl) = (tr\(A) = | P(w.Ayo(ds). 


T is called a Markov transformation. 


In particular: Define the kernel 
se if m<axu<u, 


K,(u, x) = (16.19) 


0, elsewhere . 


Notice K,(u,x) >0 and J, K.(u,x)dx = 1, all u > xo. Let 6, be the unit (Dirac) 
measure at u. Define 


PoltiyA) a= ee ae. H eee oy) 
Then 
(Tv)(A) := [Peles AoC ay (16.21) 


is a Markov transformation. 


Theorem 16.10. Let x9 € R and natural number s > 1 be fixed. Then the Markov 
transformation (16.21) uw = Tv defines an (1-1) correspondence between the set m* 
of all probability measures v on R and the set ms(xo) of all probability measures 
on Ras in Definition 16.8. In fact T is a homeomorphism given that m* and 
ms(%o) are endowed with the weak* -topology. 


Let 6: R— R be a bounded and continuous function. Introducing 


6 (u) = (Toya) = foe) - Palade, (16.22) 
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[ou = [ow (16.23) 


Here $* is a bounded and continuous function from R into itself. 
We find that 


ae t‘ o((1—t)u+tao)st®1dt, if u> xo. (16.24) 


then 


In particular 
1 


F(u~ 20)°6"(u) = Gap [ey teleyaer (16.25) 


Especially, if r > —1 we get for d(u) := (u— 20)" that d*(u) = ("t) Nu — 20)", 
for all u > xo. Here r!:=1-2---r and ("**) := tla Pala 


Ss: 


Solving the Convex Moment Problem. Let T be the Markov transformation 
(16.21) as described above. For each 4 € ms(a%o9) corresponds exactly one v € m* 
such that p= Tv. Call gf := Tgi,i=1,...,n and h* := Th. We have 


[ scan= ova 
R R 
[ww =f hdw. 
R R 


vigi) = | gtdv =e t=1,...,n. (16.26) 
R 


From (16.15), (16.16) we get find 


and 


Notice that we obtain 


J Tlaldv < +00, en ee 
R 


and 

i T\h|dv < +00. (16.27) 

R 
Since T is a positive linear operator we obtain |Tg;i| < T\g;|, 1 = 1,...,n and 
|Th| < T\hl, ie. 
7 lg; |dv <+o0, i=1,...,n 
R 

and 


| |h*|dv < +00. 
R 


That is g7, h* are v-integrable. 
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Finally 
L(c) = inf v(h*) (16.28) 
and 
U(c) = sup v(h*), (16.29) 
where v € m* (probability measure on R) such that (16.26) and (16.27) are true. 


Thus the convex moment problem is solved as a standard moment problem, 
Section 16.1. 


Remark 16.11. Here we restrict our probability measures on [0,-++co) and we 
consider the case 79 = 0. That is » € m,(0), s > 1, ie. (—1)8F@—-)(z) is convex 
for all « > 0 but u({0}) = v({0}) can be positive, v € m*. We have 


d*(u) = su-*- | (u—2)*-!. d(x) - dr, u>0. (16.30) 
0 
Further ¢*(0) = 6(0), (¢* = T¢). Especially if 
r+ts\ 
o(x) = x" then ¢*(u) = ( ) -u", (r > 0). (16.31) 
Ss 
Hence the moment 
-++oo 
Oy =| x" u(dx) (16.32) 
0 
is also expressed as 
-1 
Op = ee ‘) Bry (16.33) 
s 
where 
+00 
Br =| u"v(du). (16.34) 
0 


Recall that Ty = yu, where v can be any probability measure on [0, +00). 


Remark 16.12. Here we restrict our probability measures on [0,}], b > 0 and 
again we consider the case 79 = 0. Let yp € m,(0) and 


| x" u(dx) := ar, (16.35) 
[0,6] 


where s > 1, r > 0 are fixed. 
Also let v be a probability measure on [0,6] unrestricted, ic. v € m*. Then 
Br = ("T*)ay, where 


Ss 


a "y(du). ; 
B he u’v(du) (16.36) 
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Let h: [0,6] > R+ be an integrable function with respect to Lebesque measure. 
Consider 4 € ms(0) such that 


‘| hdw < +00. (16.37) 
[0,0] 


i h*dv < +00, vEm*. (16.38) 
[0,6] 


| hdu = | h* dv. 
[0,b] [0,0] 


Letting a, be free, we have that the set of all possible (a,, u(h)) = (u(2"), u(h)) 
coincides with the set of all 


Ge = a0") = (ea ; Hw) )) 


where yi as in (16.37) and v as in (16.38), both probability measures on [0, b]. Hence, 
the set of all possible pairs (G,, u(h)) = (G-,v(h*)) is precisely the convex hull of 
the curve 


Here h* = Th, w= Tv and 


T := {(u", h*(u)) :0<u< 5}. (16.39) 


In order one to determine L(a,) the infimum of all (hk), where py is as in (16.35) 
and (16.37), one must determine the lowest point in this convex hull which is on the 
vertical through (,,0). For U(a,) the supremum of all y(h), 4 as above, one must 
determine the highest point of above convex hull which is on the vertical through 


(8,0). 


For more on the above see again, Section 16.1. 


16.3 Infinite Many Conditions Moment Problem 


For that please see also Kemperman (1983), ((233]). 


Definition 16.13. A finite non-negative measure ~ on a compact and Hausdorff 
space S' is said to be inner regular when 


p(B) = sup{pu(K) : & C B; kK compact} (16.40) 
holds for each Borel subset B of S. 


Theorem 16.14 (Kemperman, 1983,([233])). Let S be a compactHausdorff topo- 
logical space and a; : S > R (i € I) continuous functions (I is an index set of 
arbitrary cardinality), also let a; (« € I) be an associated set of real constants. Call 
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Mo(S) the set of finite non-negative inner regular measures 1 on S which satisfy 
the moment conditions 


pay) = [assmtas) <a allie l. (16.41) 


Also consider the function b: S — R which is continuous and assume that there 
exist numbers d; > 0 (i € I), all but finitely many equal to zero, and further a 
number q > 0 such that 


1< S° djai(s) — qb(s) all s € S. (16.42) 
ier 


Finally assume that Mo(S) #0 and put 
Uo(b) = sup{u() : w € Mo(S)}. (16.43) 
= [,0(s u(ds)). Then 


Uo(b) = inf es ca; | cy > 0; 


ier 


ae cia;(s) all s € st. (16.44) 


ie 


here all but finitely many c;; i € I are equal to zero. Moreover, Uo(b) is finite and 
the above supremum is assumed. 


Note 16.15. In general we have: let S be a fixed measurable space such that each 
1-point set {s} is measurable. Further let Mo(S) denote a fixed non-empty set of 
finite non-negative measures on S. 

For f : S — Ra measurable function we denote 


Lo(f) = L(f, Mo(S)) := 


inf { [Flom f(s > he Mo(5)} (16.45) 
Then we have 


Lo(f) = —Uo(-f). (16.46) 


Now one can apply Theorem 16.14 in its setting to find Lo(f). 
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16.4 Applications — Discussion 


The above described moment theory methods have a lot of applications in many 
sciences. To mention a few of them: Physics, Chemistry, Statistics, Stochastic 
Processes and Probability, Functional Analysis in Mathematics, Medicine, Material 
Science, etc. Moment theory could be also considered the theoretical part of linear 
finite or semi-infinite programming (here we consider discretized finite non-negative 
measures). 

The above described methods have in particular important applications: in the 
marginal moment problems and the related transportation problems, also in the 
quadratic moment problem, see Kemperman (1986), ({234]). 

Other important applications are in Tomography, Crystallography, Queueing 
Theory, Rounding Problem in Political Science, and Martingale Inequalities in Prob- 
ability. At last, but not least, moment theory has important applications in esti- 
mating the speeds: of the convergence of a sequence of positive linear operators to 
the unit operator, and of the weak convergence of non-negative finite measures to 
the unit-Dirac measure at a real number, for that and the solutions of many other 
important moment problems please see the monograph of Anastassiou (1993) ,([20]). 


Chapter 17 


Extreme Bounds on the Average of a 
Rounded off Observation under a Moment 
Condition 


The moment problem of finding the maximum and minimum expected values of the 
averages of nonnegative random variables over various sets of observations subject 
to one simple moment condition is encountered. The solution is presented by means 
of geometric moment theory (see [229]). This treatment relies on [23]. 


17.1 Preliminaries 


Here [-] and [-] stand for the integral part (floor) and ceiling of the number, respec- 
tively. 

We consider probability measures 4 on A := [0,a], a > 0 or [0, +00). We would 
like to find 


Ue = sup | GD cay) (17.1) 
and 
Ye int | OD an (17.2) 
subject to 
i iG Sa eS, (17.3) 
A 


where d > 0 (here the subscript a in Uy, Le stands for average). 

In this chapter, the underlined probability space is assumed to be nonatomic and 
thus the space of laws of nonnegative random variables coincides with the space of 
all Borel probability measures on R+ (see [302], pp. 17-19). 

The solution of the above moment problems are of importance because they 
permit comparisons to the optimal expected values of the averages of nonnegative 
random variables, over various sets of observations, subject to one simple and flexible 
moment condition. 

Similar problems have been solved earlier in [20], [83]. Here again the solutions 
come by the use of tools from the Kemperman geometric moment theory, see [229]. 
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Here of interest will be to find 


U := sup | ({t] + [t]) (dt) (17.4) 
woJA 
and 
L:= int [ ((t] + [t]) (de), (17.5) 
A 
subject to the one moment condition (17.3). Then, obviously, 
U L 
Us=5) ba=5: (17.6) 


So we use the method of optimal distance in order to find U, L as follows: 
Call 

M :== conyie a(t", ((t] + [#])). (17.7) 
When A = [0,a] by Lemma 2 of [229] we obtain that 0 < d < a, in order to find 
finite U and L. Then U = sup{z: (d",z) © M} and L =inf{z: (d",z) € M}. Thus 
U is the largest distance between (d’,0) and (d’,z) € M. Also L is the smallest 
distance between (d",0) and (d",z) € M. So we are dealing with two-dimensional 
moment problems over all different variations of A := [0,a], a > 0, or A := [0, +00) 
andr>lor0O<r<l. 

Description of the graph (a, ([z] + [2])), r > 0, a < +00. This is a stairway 
made out of the following steps and points, the next step is two units higher than 
the previous one with a point included in between, namely we have: the lowest 
part of the graph is Po = (0,0), then open interval {P; = (0,1), Qo = (1,1)}, 
the included point (1,2), open interval {Pz = (1,3), Qi = (2",3)}, included point 
(2”,4), open interval {P3; = (27,5), Q2 = (3",5)}, included point (3",6),..., {open 
intervals {Py41 := (k",2k +1), Qe := ((k +1)", 2k + 1)}, included points (k”, 2k); 
k = 0,1,2,3,...}, ..., keep going like this if a = +oo, or if 0 < a ¢ N then the 
last top part, segment of the graph, will be the open closed interval {([a]", 2[a] + 
1), (a”, 2[a] + 1)}, or if a € N then the last top part, included point, will be (a”, 2a). 
Here Pyy2 = ((k +1)", 2k +3). Then 


=~ 2 
slope(Pr+1Pk+2) = (k+)"— kr’ 


which is decreasing in k if r > 1, and increasing in k if 0 < r < 1. The closed convex 
hull of the above graph M changes according to values of r and a. 
Here X stands for a random variable and F for the expected value. 


17.2 Results 


The first result follows 
Theorem 17.1. Letd>0,r>1 and0<a<-+oo. Consider 
U := sup{ E([X]+ [X]):0< X <aas., EX" =d"}. (17.8) 
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1) ifk<d<k+1, ke {0,1,2,...,[a] 1}, a¢N, then 
_ 2d” + (2k + 1)(k +1)" — (2k +3)k" 
7 (k+1)" —kr 

2) Leta ¢ N and [a] <d <a, then 

U = 2[a] +1. (17.10) 

3) Let A = [0,+00), (4e., a= +00) andk <d<k+1,k=0,1,2,3,..., then 
U is given by (17.9). 

4) Letae N— {1,2} andk<d<k+1, where k € {0,1,2,...,a—2}, then U 
is given by (17.9). 

5) Leta € N anda—1<d<a, then 
a Pata 1) —(a=1) "2a 
7 a” —(a—1)" 


U (17.9) 


U (17.11) 
Proof. We are working on the (x,y) plane. Here the line through the points 
Priit, Petz is given by 
Qa" + (2k +1)(K +1)" — (2k 4+ 3)k" 
a ae 
In the case of a € N let P, := ((a—1)",2a—1). Then the line through P, and 
(a”,2a) is given by 


(17.12) 


x” +a"(2a—1)-—(a—1)'2a 


Baa (17.13) 


y <=) 
The above lines describe the closed convex hull M in the various cases mentioned 
in the statement of the theorem. Then we apply the method of optimal distance 
described earlier in Section 17.1. 


Its counterpart follows. 
Theorem 17.2. Letd>0,0<r<1and0<a<+oo. Consider (17.8), 
U := sup{E([X]+ [X]):0< X <aas.,EX" =d'}. 
1) LetO<a€WN and [a] <d<a, then 


U = 2a) +1. (17.14) 
2) LetO<a€N and0<d< |al, then 
2a) 

U = —d' +1. 17.15 

al" ean 


2*) Leta=1 and0<d<1, thenU =d" +1. 
3) Letae N—{1},0<d<aand0<r< $, then 


us (A) a. (17.16) 


a” 
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—L. 
I—r 


4) Letae N—{1},0<d<a-1,4<r<landa> Gr)" then 
(2r) T=" -1 
U =2(a—1)' Td" +1. (17.17) 
a 
5) Letae N—-{1},a-1l<d<a, $<r<l and a> a= , then 
(2r)1-r -1 
d" >) r 1)" r 
U= date ale a (17.18) 
a” —(a—1)" 
6) Let A = [0,+00), (t.e., a= +00), then 
U = +00. (17.19) 


Proof. We apply again the method of optimal distance. From the graph of 
(a”, ((z] + [x])) we have that the slope (Py4i1Pe42) = 
k; (see Section 17. 1). 

Let 0 < a ¢N, then the upper envelope of the closed convex hull M is formed by 
the line segments {P; = (0,1), Praj+1 := ((a]", 2[a] + 1)} and {Piaj4i, (a", 2[a] +1)}. 
Then case (17.1) is clear. 

The line (P; Piaj+1) in the (2", y) plane is given by 

,— 2a 
[a] 


2 Bas : é 
(eply"— kr is increasing 1 


a’ +1. (17.20) 


Therefore case (17.2) follows immediately. 
Case (17.6) is obvious by the nature of M when 0 <r <1. 
Next let ae N— {1} and0<r< 4. That is., 


1l-r 
xr<i<(—.) , for any € € (a—1,a). 
i 
That is 2(a—1)!-" < et: any € € (a—1,a). Thus by the mean value theorem 
we find 

1 
a’ —(a—1)" 
Therefore slope(mo) < slope(m1), where mo is the line through the points P,; = 
(0,1), Pa = ((a—1)",2a—1), and m, is the line through the points P,, (a",2a). In 
that case the upper envelope of M is the line segment with endpoints P, = (0,1) 
and (a”,2a). This has equation in the (x",y) plane 


a= 1 
v=( _ Jorn. (17.22) 
a 


%a—1)*" < (17.21) 


The last implies case (17.3). 
Finally let a € N— {1} and $ <r <1, along with 
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Therefore 


a(@n"F—1)> QnNTF and alr) - Qnty >a. 


That is (2r)™*(a—1) >aand 


(4) < (2r). 


1l-r 
That is, (=) < 2r and €1—" < al" < (a—1)!""2r, any € € (a—1,a). Thus 
El-” < 2r(a—1)1~, and ae < 2(a—1)'"", any € € (a—1,a). And by the mean 
value theorem we find that 
1 
slope(™m1) = af =r SS slope(mo) = 2(a — Ly" 


where again my, is the line through P, = ((a — 1)",2a — 1) and (a”, 2a), while mo 
is the line through P; = (0,1) and P,. Therefore the upper envelope of M is made 
out of the line segments P,P, and P,(a",2a). The line mo in the (", y) plane has 


equation 
y = 2(a—1)' "2" +1. (17.23) 
Now case (17.4) is clear. 
The line m in the (a, y) plane has equation 
x’ + 2a(a" —(a—1)")—a" 


y= i etek (17.24) 


At last case (17.5) is obvious. 


Next we determine L in different cases. We start with the less complicated case 
where 0 <r<l. 


Theorem 17.3. Letd>0,0<r<1and0<a<+oo. Consider 
L:=inf{E([X]+[X]):0< X <a as, EX" =d'}. (17.25) 
1) Let 1 <a¢N and0<d<1, then 
L=d. (17.26) 


1*) Leta<1,0<d<a, thn L=a'd’. 
2) Let2<aG¢Nandk+1<d<k+2 fork e€ {0,1,...,[a] —2}, then 
_ 2d" + (2k+1)(K4+ 2)" — (2k 4+ 3)(k +1)" 


é re eee (17.27) 
3) Let 1 <a€N and [a] <d <a, then 
i= 2d” + a" (2[a] — 1) — [a}"(2[a] + 1) (17.28) 


a" — fal” 
4) Letae N andO<d< 1, then 
L=@. (17.29) 
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5) Leta ee N— {1} andk+1<d<k+2, where k € {0,1,...,a—2}, then 
L is given by (17.27). 
6) Let A := [0,+00) (i.e., a= +00) and0<d< 1, then 
baw (17.30) 


7) Let A:= [0,+00) andk+1<d<k+2, where k € {0,1,2,...}, 
then L is given by (17.27). 


Proof. The lower-envelope of the closed convex hull M is formed by the points 
Po = (0,0), {Qk = ((k + 1 2k + 1); k= 0, 1, 2,3, os sb Qiaj—1 = ({a]”, 2[a] = 1), 
F := (a",2{a]+ 1). Here 


2 
slope(QkQr+1) = (k+2)"—-(F+D"’ 


which is increasing in k since 0 < r < 1, and slope(PoQo) = 1. Notice that 
slope(QoQ1) = 5-4 > 1. When 0 < a ¢N we see that 


slope(Qjaj—1f") > slope(Qjaj—2Q|aj-1)- 
Therefore the lower envelope of M is made out of the line segments PoQo, QoQ1, 
Q1Q2,---, QeQk+15-- ++ Qaj—2Q|aj—1 and Qjaj_iF’. In the cases of a € N or a = +00 
we have a very similar situation. Cases (17.1), (17.4) and (17.6) are clear. 
The line (QxQx+1) has equation in the (a", y) plane 
Qa? + (2k + 1)(k +2)" — (2k + 3)(k +1)" 
7 (k +2)" —(k4+1)" 
Now the cases (17.2), (17.5) and (17.7) follow easily by the principle of optimal 
distance. 
Finally the line (Qjqj-1/’) in the (#", y) plane has equation 
_ 2a" + a"(2[a] — 1) — [a]"(2[a] + 1) 
y= a’ — [a]" 


(17.31) 


(17.32) 


Thus, case (17.3) now is obvious by applying again the principle of optimal (mini- 


mum) distance. 


The more complicated case of L for r > 1 follows in several steps. Here the 
lower envelope of the closed convex hull M is formed again by the points Py = (0,0), 
Qo = (1,1),...,Q% = ((K+1)", 2k4+1),..., Qjaj—1 = ([a]”, 2[a]—1), F = (a”, 2[a]+1) 
or F* = (a",2a) if a € N. In the case of A = [0,+00), the lower envelope of M is 
formed just by Po, Qx; all k € 71. Observe that the 

= 2 
slope(QeQk41) = (+2) —(k+1)" 


is decreasing in k since r > 1. 
Theorem 17.4. Let A = [0,+00), (te, a= +00) r>1 andd>0. Consider 
L := inf{E((X]+[X]): X >0as., BX" =d'}. (17.33) 
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We meet the cases: 
1) Ifr >1 we get that L=0. 
2) Ifr=1and0<d<1, then L=d. 
3) Ifr=1 and1<d<-+ov, then L = 2d-1. 


Proof. We apply again the principle of optimal (minimum) distance. 
When r > 1 the lower envelope of M is the «’-axis. Hence case (17.1) is clear. 
When r = 1, then slope(Q;Qz4i1) = 2 any k € Zy and slope(PoQo) = 1. 

Therefore the lower envelope of M here is made by the line segment (PoQo) with 

equation y = x, and the line (QoQ...) with equation y = 2% — 1. Thus cases (17.2) 

and (17.3) are established. 


Theorem 17.5. Leta € N— {1}, d>0 andr >1. Consider L as in (17.25). 
1) pe >1 and0<d<1, then 


paw: (17.34) 
2) rp 2 > and1<d<a, then 


= 2(a — 1)d” + (a” — 2a +1) 


17. 
—— (17.35) 
3) If 220 <1 and0 <d<a, then 
2a-—1 
b= ( “— Ja (17.36) 
a 


Proof. Here mo := line(PoQo) has equation y = x” and slope(mo) = 1. Also 
my, := line(QoQa-1), where Qa_1 = (a", 2a — 1), has equation 


_ 2(a—1)z" + (a” —2a+1) 


: (17.37) 
a” —1 
with slope(m1) = 2ot). 
Notice also that slope(Q.Qx+41) is decreasing in k. If 2a~-1) > 1, then the lower 


envelope of M is made out of the line segments P9Qo and QoQa_1. Hence the cases 
(17.1) and (17.2) are now clear by application of the principle of minimum distance. 
If Ata) <1, then the lower envelope of M is the line segment (PoQa_1) with 


y= (= *) as (17.38) 


associated line equation 
a” 


Thus case (17.3) is established similarly. 
Theorem 17.6. Let2<a¢N, n:=[al,d>0 andr>1. Suppose that 
p< 2m)? 


~ n?—-1 ~— avn 


(17.39) 


Consider L as in (17.25). 
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1) [f0<d<1, then 
bod, (17.40) 
2) Ifl1<d< [a], then 


2({a] — 1)d" + ([a]” — 2[a] + 1) 


io al" — 1 


(17.41) 


_ 2d" + a®(2[a] — 1) — [a]"(2[a] + 1) 


a” — [al" 


(17.42) 


Proof. The lower envelope of M is made out of the line segments: {P) = (0,0), 
Qo = (1,1)} with equation y = x", {Qo = (1,1), Qiaj-1 = ([a]", 2[a] — 1)} with 
equation 
2(la] = La" + ([a]” = 2la] + 1) 
[a" - 1 
and {Qtaj—1, F = (a",2[a] + 1)} with equation 
_ 2a" + a"(2[a] — 1) — [a]"(2[a] + 1) 
y= are [a]” . 


Y= 


Then cases (17.1), (17.2), (17.3) follow immediately by application of the principle 
of minimum distance as before. 


Theorem 17.7. Let2<a¢N, n:= [a], d>0 andr>1. Suppose that 
2(n — 1) % 2 


> é 17.4 
12 n—1 ~ a —nt ce) 
Consider L as in (17.25). If0 <d<a, then 
Es (a) a. (17.44) 
a” 


Proof. Here the lower envelope of M is made out of the line segment {Po = (0,0), 
F = (a”,2[a] +1)} with equation 


(a) 


Theorem 17.8. Let2<a¢N, n:= [a], d>0 andr >1. Suppose that 


etc. 


- 2 
2(n—-V < <1 (17.45) 
nr —1 a’ — nr 
and 
2n—1 2 
Lae (17.46) 


nr ar —n 
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Consider L as in (17.25). 
1) If0<d< |al, then 


ee Ge = -) d”. (17.47) 


_ 2d" + a" (2{a] ~ 1) = [a)"(2la) +1) 


a” — [a]" 


(17.48) 


Proof. Here the lower envelope of M is given by the line segments: {Pp = (0,0), 
Qiaj—1 = ([a]", 2[a] — 1)} with equation 


v= (AE) 


and {Qjqj-1, F = (a",2[a] + 1)} with equation 


ne 2x" + a™(2[a] — 1) — [a]”(2[a] + 1) (17.49) 
a’ — fal” 
Theorem 17.9. Let2<a¢N, n:= [a], d>0 andr >1. Suppose that 
2(n—1 2 
an=t) <1< , (17.50) 
nt —1 ie te 


Consider L as in (17.25). Then L is given exactly as in cases (17.1) and (17.2) of 
Theorem 17.8. 


Proof. The same as in Theorem 17.8. 


Theorem 17.10. Let 2<a¢N,n:=[a],d>0 andr >1. Suppose that 


2 2(n — 1) 
< —— 17.51 
aon =) Sarr ey) 
and 
eer (17.52) 
ar —1 ; ; 
Consider L as in (17.25). 
1) If0<d<\1, then 
L=d’. (17.53) 
2) If1<d<a, then 
2\a|d” "—2lal—1 
L= 2lala” + (a" — 2[aj — 1) (17.54) 


a” —1 


Proof. Here the lower envelope of M is made out of the following line segments: 

{Po = (0,0), Qo = (1, 1)} with equation y = x”, and {Qo, F = (a”, 2[a] + 1)} with 

equation 

2[a]a” + (a” — 2[a] — 1) 
a’ —1 


y= ; (17.55) 
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etc. 


Theorem 17.11. Let 2<aZ¢N, n:= [a], d>0 andr >1. Suppose that 
2. An-1) 


a —n nr— 


1< 


(17.56) 


Consider L as in (17.25). Then L is given exactly as in cases (17.1) and (17.2) of 
Theorem 17.10. 


Proof. Notice that the lower envelope of M is the same as in Theorem 17.10, 
especially here 24 > 1. Etc. 


ar—1 
Theorem 17.12. Let2<a¢N,n:= [a], d>0 andr >1. Suppose that 
2(n—1 2 
202) na 1 (17.57) 
nr —1 a” —n" 
and 
2n-—1 2 
= (17.58) 
n av —n 
Consider L as in (17.25). 
If0<d<a, then 
2 1 
E= Geo) a’. (17.59) 
a 
Proof. Here the lower envelope of M is the line seement PyF with equation 
2 1 
- (a) Ce (17.60) 


etc. 


Theorem 17.13. Let2<a¢N,n:= [a], d>0 andr >1. Suppose that 


2 2(n — 1) 
<1< 17.61 
av’ —nr — — nr—] (17.61) 
and 
2n 
<1. (17.62) 
a’ —1 
Consider L as in (17.25). 
If0<d<a, then 
2 1 
ie ( + ja (17.63) 


Proof. Same as in Theorem 17.12. 
Proposition 17.14. Leta=1,r>1and0<d<1. Then L=d’. 


Proposition 17.15. Let0O<a<1,r>1and0<d<a. Then L=a"d’. 
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Proposition 17.16. Leta=2,r> pn andQ0<d<2. Then 
L=3-27"d". 


Proposition 17.17. Leta=2,1<r< me, 
1) If0<d<1, then L=d'. 


2) f1<d<2, then L= 24. 


Proposition 17.18. Let 1<a<2,r>1 and 2(a"—1)7"! >1. 
1) If0<d<1, then L=d'. 
2) Ifl1<d<a, then L = 240-3. 


a™—1 
Proposition 17.19. Let 1<a<2,r>1 and 2(a"—1)7! <1. 
If0<d<a, then L =3a~"d’. 
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Chapter 18 


Moment Theory of Random Rounding 
Rules Subject to One Moment Condition 


In this chapter we present the upper and lower bounds for the expected convex com- 
binations of the Adams and Jefferson roundings of a nonnegative random variable 
with one moment and range conditions. The combination is interpreted as a ran- 
dom rounding rule. We also compare the bounds with the ones for deterministic 
rounding rules at the respective fractional levels.This treatment follows [89]. 


18.1 Preliminaries 


For a given » € (0,1), we define a function f(x) = Ala] +(1—A)[a],  € R, where 
|x| and [a] denote the floor and ceiling of x, respectively. Note that f,(x) can 
be interpreted as the expectation of rounding x according to the random rule that 
rounds down and up with probabilities \ and 1 — A, respectively. 

Denote by X an arbitrary element of the class of Borel measurable random 
variables such that 0 < X < a almost surely and EX” = m,. for some 0 < a < +00, 
and r > 0, and 0 < m, < a’. The objective of this chapter is to determine the 
sharp bounds 


U(m,) = U(A, a, 7, Mr) = sup Efy(X), 
L(m,) = LO, a,r,m,) = inf Efy(X), 


for all possible choices of parameters, where the extrema are taken over the respec- 
tive class of random variables. 

The special cases of the Adams and Jefferson rules, 4 = 0 and 1, respec- 
tively, were studied by Anastassiou and Rachev (1992),[83] (see also Anastassiou 
(1993, Chap.4)),{[20] . The analogous results for ) = 1/2 can be found in Anas- 
tassiou (1996),[23] and Chapter 17 here. The method of solution, that will be 
also implemented here, is based on the optimal distance approach due to Kemper- 
man (1968),[229]. In our case, it consists in determining the extreme values of the 
second coordinate of points (m,, xz) belonging to the convex hull H = H(),a,r) of 
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the closure G = G(A, a,r) of the set {(t", fx(t)): O<t < a}. Clearly, 


n-1 N 
G= J{.k+1-d): ks ts k+1pu LJ {(K",’)} 
k=0 k=0 


U{(t", n+1—-A): n<t<at}, 
where n = sup{k € NU {0}: & < a} and N = |a| (=n unless a € N). It will be 
convenient to use both the notions n and N in describing the results. The planar 
set G can be visualized as a combination of horizontal closed line segments U;,Ly41, 
0 <k < N, and possibly Uy A, and separate points S;, 0 < k < N, say, where 
U, = (k",k+1—A), Le = (k",k — A), A = ("5 n+1-—A), and Sy = (k",k). 
The purpose is to determine, for fixed A, a and r, the analytic forms of the upper 
and lower envelopes of H, U(m,) and L(m,), 0 < m, < a, respectively. Since 
Sy € LU, we can immediately observe that 
conv {So,Uo,11,...,Ln,Un, A}, if a€R,\N, 
H=<¢ conv {59,Uo,11,...,Ln,Un, La, Sa}, if ae N, (18.1) 
conv {So, Uo, L1,...,; Dr, Uk,..-}, if a= +00. 

A compete analysis of various cases, carried out in Section 18.2, will allow us to sim- 
plify further representations (18.1) E.g., the shape of H is strongly affected by the 
fact that the length of U, L441 is constant, decreasing and increasing for r = 1, r < 1 
and r > 1, respectively, as k increases. The argument makes reasonable considering 
the cases separately in consecutive subsections of Section 18.2. In each subsection, 
we derive both the bounds by examining more specific subcases, described by vari- 
ous domains of the end-point a. In Section 18.3 we compare the bounds for E'f,(X) 
with those for respective Ef\(X), 0 < \ < 1, where fX(x) = |e +1-— Al] is the 
deterministic A-rounding rule, and X belongs to a given class of random variables, 
determined by the moment and support conditions. Note that both the f, and 
fx round down and up the same proportions of noninteger numbers. The bounds 
for the latter were precisely described in Anastassiou and Rachev (1992),[83] and 
Anastassiou (1993),[20]. 


18.2 Bounds for Random Rounding Rules 


18.2.1. Case r = 1. Here L; € LL, and U;€ U,U; for all i < 7 < k. Thus we 
can significantly diminish the number of points generating H (see (18.1)). If a is 
finite, then H is a hexagon with the vertices $9,Uo,Un,A,Ln,L1 for a ¢ N and 
So, U0, Un, Sa, La, 1 for ae N. If a = +00, then AH is the part of an infinite strip 
and has the borders UgSo, Sol and the parallel halflines UpU;~ and 1, L5~. In 
Theorem 18.1 and 18.2 we precisely describe U(m,) and L(m1), respectively. 


Theorem 18.1. (i) If NZ a< o, then 
my +1-—A, forO<m, <N, 


ua ae 1—A, for N<m, <a. ee) 
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(ii) If N 3 a < ow, then 


vine (tettak freemen a 
(iii) If a = +00, then 
U(m) =m, +1-A, for m, > 0. (18.4) 
Theorem 18.2 . (i) [f0<a<1, then 
L(m) =(1—Aja~* mM, ~— for OX m <a. (18.5) 
(ii) If 1 <a<o, then 
(1—A)m, for0<m, <1, 
L(m1) = « m1 —A, forl<m,<n, (18.6) 
aftn+1l—d, forn<m, <a. 
(iii) If a= +00, then 
Him {OrAmtosm ss gan 


We indicate the supports of distributions providing bounds (18.2), (18.7) (possibly 
in the limit). We have U(m1) = m; +1 -— A in (18.2), (18.4) for combinations of 
t \, k, the limits being integer such that 0 < k <a. The latter formulas in (18.2) 
and (18.3) are obtained for mixtures of any t € (n, a], andt \, n with a, respectively. 
Two-point distributions supported on 0 and a provide (18.5). We obtain the first 
bounds in (18.6),(18.7) for mixtures of 0 and t 7 1. The second are achieved for 
those of arbitrary t “7k, k =1,...,N. Finally, the last bound in (18.6) is attained 
once we take combinations of t 7 N and a. The supports can be easily determined 
by analyzing the extreme points of the convex hull H. We leave it to the reader to 
establish the distributions, possibly limiting, that attain the bounds for r ¥ 1. 


18.2.2. Case 0 < r <1. The broken lines determined by So, 11,...,L£n,A and 
Uo, U,,...,Un are convex. This implies that the lower envelope of H coincides 
with the former one (see (18.13) below). In the trivial case a < 1, this reduces 
to SoA (see (18.12)). Also, Ui,...,Un—1 do not belong to the graph of U(m,), 
which, in consequence, becomes the broken line joining Uo,U, and A for a finite 
noninteger a (cf (18.8)). For a € N \ {1}, we should consider two subcases. If 
is sufficiently small, then the slope of UoS, is less than that of UopU, (precisely, if 
(a—1+A)a~" < n'~") and therefore U(m,) has two linear pieces determined by 
the line segments UpU, and U,,Sq (cf (18.10)). Otherwise we obtain a single piece 
connecting Up with S, (see (18.9)). The same conclusion evidently holds when 
a= 1. Examining the representations of U and L as a — +00, we derive the 
solution in the limiting case a = +00 (see (18.11) and (18.14)). 
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Theorem 18.3. (i) If a finite a ¢ N, then 


ine) N'-'m, +1—., for0<m, <N’, 
Mr —s 
N+1-), for N° <m, <a’. 


(ii1) If ae N and X>n'~"a" +1—a, then 


U(m,) =(a-1+A)a"m,+1—, for 0<m, <a’. 


(itz) If ae N and X< n'a" +1—a, then 
{Sia +1-—, forO<m,. <n’, 


Ae ta, forn” <m, <a’. 


av’ —n” 


U(m,) = 


(iit) If a = +00, then 
U(m,) =+c0, form, >0. 


Theorem 18.4. (i) If0<a<1, then 
L(m-) =(1—A)a7"m,,  forO0<m, <a". 
(ii) If 1 <a< +00, then 


(1—A)m,, for 0<m, <1, 


Mrp—k" a rl 
Heppner ee OS LORS eRe), 
andk =1,...,n—1, 


ni ae t+n+1-—-A, for n™ <m, <a’. 


(iti) If a = +00, then 
(1—A)m,, for0<m, <1, 
L(mr) = 4 Gee +k A, for kh <m, <(k+1)", 


and k =1,2,.... 


18.2.3. Case r > 1. 
Theorem 18.5. (i) If a is finite noninteger, then 


Tepe tk+1—A, for k” < my < (k+1)", 
U(m,) = andk =0,...,N—1, 
N+1-)A, for N°<m. <a’. 
(ti) If a is finite and integer, then 
Rie tk+1-A, for k” <m, < (k+1)’, 
U(m,) = andk =0,...,n—1, 
Mma") 4 q, for n™ <m, <a’. 


(iti) If a is infinite, then 


Mr—k" a : 
U(m,) = Ceprae tk+1—A, fork <m, <(k+1)’, 
andk =0,1,.... 


(18.8) 


(18.9) 


(18.10) 


(18.11) 


(18.12) 


(18.13) 


(18.14) 


(18.15) 


(18.16) 


(18.17) 
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Bounds (18.15), (18.16) and (18.17) are immediate consequences of the fact that 
the broken lines with the knots Uo,...,Un,A for a € N, Uo,...,Un,Sq for a EN, 
and Uo, U;,... for a = +00 are concave. For the special cases a < 1 and a = 1, 
yields Up = Uy, and so the upper bounds reduce to the latter formulas of (18.15) 
and(18.16), respectively. 


Theorem 18.6. (i) If0 <a< 1, then (18.12) holds. 
(ii1) If 1 <a<2 and (2—A)a" < 1-4, then 


L(mr) =(2-A)a"m,, forO<m, <a". (18.18) 
(tig) If l<a<2 and (2—A)a~" > 1-4, then 


_ fd—-A)m,, for0<m, <1, 
POG) = { A, for 1 <m, <a’. (18.19) 
Suppose that 2<a< +oo. 
(itt) If (n+1—A)a™ < min{1l— A, (n—A)n~}, then 
L(m,) =(n+1—-A)a-"m,, for 0<m, <a’. (18.20) 
(iti2) If (n-—A)nT < (n+ 1—Aj)a" and (n— A)n~" <1—4, then 
_f GR An me, for0<m, <n", 
Lie) = | nna Gh e ROR ea (18.21) 
(itiz) If 1-—2A< min{(n+1— Aja", (n—A)n7 < 2+, then 
(1 —A)m,, forO<m, <1, 
L(m,) = nmr : 
(m+) alm) +1—A, for 1<m, <a’. 1822) 
(itig) If 1—X < min{(n+1—A)a™, (n—A)n-"} and 4 > 4, then 
LS es for0O<m, <1, 
L(m,) =< = + (mp —1)+1—4, for 1 <m, <n" (18.23) 
met +n+1—X, forn™ ie bf. 
(iv) If a= +00, then 
L(m,) =0, for m,>0. (18.24) 
We start by examining case (ii). Since L1,..., 0, generate a concave broken 


line, it suffices to analyze the mutual location of So = (0,0), Zi = (1,1 — A), 
Ly, = (n",n— X) and A = (a",n+1-—A). This means that we should consider 
four shapes of the lower envelope determined by the following sets of knots So, A, 
and So, Ln, A, and So, £,,A and So, £1, Ln, A, respectively. We show that the all 
cases are actually possible. First fix \ and n. If a \, n, then the slope of L,,A can 
be arbitrarily large. This implies that the slope of SoA, equal to (n+ 1—A)a~", 
becomes greater than that of SoLn, which is (n — A)n~". The same conclusion 
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holds with So replaced by Ly, for which we derive n/(a” —1) > (n—1)/(n" —1). If 
a /7n-+1, then A— Ly41, and, in consequence, SoA and LA run beneath SoLn 
and L,L,,, respectively. 

If we fix a, and let vary, we do not affect the mutual location of L1,..., Dy 
and A. For \ 7 1, the slope of Sol, equal to 1 — A, becomes smaller than either 
of SoLn and SoA. If A \, 0, this can be arbitrarily close to 1. Then this becomes 
greater than either of n'~" and (n+1)a~"(< a!~"), the limiting slopes of SoL, and 
‘SoA, respectively, since r > 1. Letting \ vary from 0 to 1, we meet the cases for 
which 1 — lies between (n + 1 — A)a~" and (n — A)n~". This shows that all four 
cases are possible: 


(iit,) the slope of SoA is minimal among these of SoA, SoLn and SoLj, and (18.20) 
is the linear function determined by So and A, 

(itt2) the slope of SoL, is less that that of SoA and not greater than that of SoL1, 
and (18.21) has two linear pieces, with the knots So, L, and A, 

(iti3) we have three knots So, L, and A (see (18.22)) if the slope of SoL, is minimal, 
and, moreover, the slope of £1 A is not greater than that of Li Ly, 

(iti4) bound (18.23) has three linear pieces with the ends at So,L1,L, and A, if 
the slope of SoLy is less than either of SoL, and SoA, and the same holds for 
the pair L;L, and L,A. 


Ifl<a< 2, then Ly; = Ly, and the problem reduces to two cases: we have (18.18) 
if the slope of SoA does not exceed the slope of Soli, and (18.19) otherwise. If 
a <1, then n = 0, and the lower envelope of H is S9A. When the support of X is 
unbounded, it suffices to take a sequence of two-point distributions with the masses 
1—m,a~" and m,a~" at 0 and N 7a — +00. Then E'f)(X) coincides with either 
the right-hand side of (18.20) or the former formula in (18.21), which vanish as 
a — +o0 and so (18.24) holds. 


18.3. Comparison of Bounds for Random and Deterministic Round- 
ing Rules 


In order not to blur the general view, we concentrate on the problems of practical 
interest for which the right end-point a is sufficiently large. If the rounded variables 
take on one or two values only, we derive simple results, but usually completely 
dissimilar to those for larger a. For the same reason, we rather confine ourselves 
to indicating qualitative differences between the bounds of the random and de- 
terministic roundings. The reader that will be interested in obtaining the precise 
comparison of the bounds for a specific choice of parameters, is advised to find the 
respective formulas in Section 18.2 and in Anastassiou (1993, Section 4.1),[20]. 
Fixing A, a and r, set Li = ((k+ A)", k), Uf = (K+ A)", k +1), Ke NU {0}, 
A* = (a",n+1) and A, = (a",n). Note that A lies above A, and below A*. 
We first recall the fact that the upper and lower bounds (denoted by U* and L*, 
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respectively) for the expectation of the deterministic \-rounding rule are determined 
by the upper and lower envelopes of the convex hull H* of the set 


Sola U Ugo UF LE, UUZA*, = for a> n+. 


. oo Ax, for a<n+A, 


Below we shall repeatedly make use of the facts that all L; and Lj belong to the 
graph of the function t + t!/" — A, and the same holds for U; and Uf and the 
function t + t!/" +1 —. The shapes of the functions depend on the moment order 
parameter r > 0. As in the previous Section 18.2, we study there cases. 


18.3.1. Case r =1. Taking a large enough, we can see that L = L* and U = U* in 
some intervals of m,. In particular, for a = +oo, the domain of m, for which the 
roundings are equivalent is an infinite interval. We present now some final results, 
leaving the details of verification to the reader. 

We first compare the upper bounds under the condition a > 1. It easily follows 
that U(mi) > U*(m1), when 0 < mj, < A. Furthermore, U(m,) = U*(m) for 
A<m, <n+1—A when a < n+A, and for A < m, < n otherwise. (Clearly, n = +00 
if a = +co.) In the former case, we have U(m1) > U*(m,) forn+1—-A<m <a, 
and the reversed inequality for n < mj, <a in the latter one. 

We now treat L and L* in the casea > 14+. For 0 < m, < 1 we have 
L(m,) > L*(my1). Ifa < N+A, then L(m,) = L*(m1) for 1 < my < N—1+A, and 
otherwise the equality holds for 1 <_m, < N. If m, is close to the upper support 
bound, then LZ and L* are incomparable, except of the special cases a = N and 
a= N+ 2 for which L < L* in (N+1-—4,a) and L > L* in (N,a), respectively. 
IfNZa<N+4A, then L — L* changes the sign from — to + in (N +1-—4,a). If 
a> N+A, then L — L* changes the sign from + to — in (N,a). 

Observe that U dominates U* for all possible moment conditions iff either a < 
n+. or a=-+oo, and L dominates L* iff a= N+. and a=-+oo. Generally, the 
deterministic rule fy has a smaller expectation for small m,. For large ones, the 
relations between the bounds strongly depend on the relation between the fractional 
part of a and the rounding parameter 4X. 


18.3.2. CaseO<r<l. If0<m, <A’, then 0 = L*(m,) < L(m,). Observe that 
L,, and Lj, belong to the graph of a strictly convex function. This implies that each 
L, and Ly lie under L%_, Li and Ly Lx+1, respectively, and, in consequence, L(m,) 
and L*(m,) cross each other exactly twice in every interval (k”,(k+1)"). The former 
function is smaller in neighborhoods of integers and greater for m, ~ (K+ A)”, with 
k © NU{O}. At the right end of the domain, if a < n+ A, then L* ¢ H* and 
L* ends at A,, situated beneath A. It follows that ultimately L(m,.) > L*(m,), as 
m, / a’. Ifa<n+dA, then the last linear piece of L* is L* A* and so L* is greater 
in the vicinity of a”. Generally, the mutual relation of ZL and L* is similar to that 
for r = 1 expect of that in the central part of the domain, where the equality is 
replaced by nearly 2N crossings of the functions. 
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We now examine the upper bounds. Suppose first that a € N. Then the upper 
envelope of H is determined by Uo, A and possibly U,,, and that of H* by So,U;7, A 
and possibly UG. Since Up, Ug, Un and UF are points of the graph of a convex func- 
tion, UgU;, crosses Us0e Also, the straight lines determined by Uo, Up, and Ug, UF 
run above the points So, U§ and U,, A, respectively. Hence, if the upper envelopes 
of H and H* actually contain U, and Uj, respectively, they cross exactly once for 
some m, € (A",n). If H does and H* does not, then the line running through 
So,U* and under Uj crosses once the concave broken line joining Uo,U, and A. 
The same arguments can be used, when Uj is an extreme point of the respective 
hull, and U,, is not. Also, the line segments UjA and SoU* have a crossing point. 
In conclusion, U > U* for small arguments and U < U* for large ones. 

The same holds true for noninteger a > n+. Here Uy is actually a point 
generating H and we can drop two of four subcases in the above considerations. If 
NZa<n+A, the knots of U* are So,U*_,, As and possibly Uj. Now UZU*_, is 
situated beneath UpU,,. Since, moreover, Sg lies under Up and A, lies under A, we 
immediately deduce that U > U* for all m, < a". Finally, for a = +00, we have 
U =U* =-+o. 


18.8.3. Case r > 1. We first study the upper bounds. Here U; lies above OS uF 
and U; lies above U,U,41 for arbitrary integer k, because the above points are 
elements of the graph of a strictly concave function. In consequence, the intervals 
in which U > U* contain m, = k", and are separated by the ones containing 
m, =(k+A)", where U < U*. At the left end, U(m,) = 1— > U*(m,) \, 0, as 
m, \, 0, like in the other cases. At the right one, we consider two subcases. For 
a >n-+A (a — possibly integer), U*(m,) =n+1>U(m,), as m, is close to a’, 
and the last crossing point of U and U* belongs to (n”,(n+A)"). Ifa < n+A, then 
U(m,) =n+1—2X> U*(m,) =n for n” < m, < a", and the last crossing point 
lies in ((n —1)",(n—1+ )"). This is also an obvious analogy with the cases r < 1. 

It remains to discuss the most difficult problem of the lower bounds for r > 1. 
There are various possibilities for the lower envelope of H. This may have four 
knots So, £1, 2, and A, merely the outer ones being obligatory. Assuming that 
a >n-+ 4, there are two possible envelopes of H*: one determined by So, Lo, Li, 
and A*, and the other with L* dropped. Since ZL, is located above Li L*, in 
the former case So, L; and L,, belong to the epigraph of L*, and A does not. This 
implies that Z and L* cross once in (0,a"), for all possible forms of ZL. The same 
conclusion holds for the latter case, where the slope of Le At is not greater than 
that of L§L*, and So, Lo, Ln remain still separated from A by Te A*. Therefore for 
a>n-+X, L(m,) is less than L*(m,) for small m,, and greater for the large ones. 
In fact, the crossing point is greater than n” so that the former relation holds in an 
overwhelming domain. 

If a < n+, then the lower envelope of H* is determined by either 
So, Lo, L,_1, Ax or So, £5, Ax. In the four-point case, we can check that So, Li 


n—1> 
and A lie above the envelope and L,, does not. Hence, if ZL, is an extreme 
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point of H (see Theorem 18.6 (iti2), (iti4)), then L — L* changes the sign from + 
through — to +, and is positive everywhere otherwise. In the three-point case, 
Ln may also be located in the epigraph of L* as So,£, and A are. Therefore 
an additional condition for the two sign changes of ZL — L* should be imposed. 
This is (n — A)(a” — A") < n(n” — A"), which is equivalent with locating L, un- 
der TAs: Otherwise L dominates L* for all m, <a". The final obvious conclusion 
is L(m,) = L*(m,) = 0 for 0 < m, < +00 = a. 
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Chapter 19 


Moment Theory on Random Rounding 
Rules Using Two Moment Conditions 


In this chapter sharp upper and lower bounds are given for the expectation of 
a randomly rounded nonnegative random variables satisfying a support constraint 
and two moment conditions. The rounding rule ascribes either the floor or the 
ceiling to a number due to a given two-point distribution.This treatment follows 
[85]. 


19.1 Preliminaries 


Let X be arandom variable with a support contained in the interval [0, a] for a given 
0<a< +c and two fixed moments EX =m, and EX" =m, forsome0 <r #1. 
Assume that the values of X are rounded so that we get the floor 


|X| =sup{keZ: k< XxX} 
and the ceiling 
[X] =inf{keZ: k> xXx} 
with probabilities \ € [0,1] and 1—A, respectively. Our aim is to derive the accurate 
upper and lower bounds for the expectation of the rounded variable 
U(A,a,7,m1,M,) = sup E(ALX| + (1 — A)[X)]), (19.1) 
L(A, a,7,7m1,m,) = inf E(ALX| + (1 — A)[X)), (19.2) 


respectively, for all possible combinations of the arguments of the left-hand sides 
of (19.1) and (19.2). The special cases of the problem with A = 0 and A = 1, cor- 
responding to well-known Adams and Jefferson rounding rules, respectively, were 
solved in Anastassiou and Rachev [83] and Rychlik [313] (see also Anastassiou 
(20,Chapter 4]). In fact, in the above mentioned publications there were exam- 
ined more general deterministic rules that round down unless the fractional part 
of a number exceeds a given level A € [0,1], which surprisingly leads to radically 
different bounds from these derived for the respective random ones. The latter were 
established in the case of one moment constraint EX" = m,, r > 0, by Anastassiou 
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and Rachev [83] for A = 0,1, and Anastassiou [23] for \ = 1/2 and Anastassiou and 
Rychlik [89] for general X. 

A general geometric method of determining extreme expectations of a given 
transformation of a random variable, say g(X) (here g,(x) = Ala] + (1 — A)fa]), 
with fixed expectations j11...,~4, of some other ones f1(X),..., f¢(X), say, (here 
fila) = x, fo(a) = x”) was developed by Kemperman [229]. The crucial fact the 
method relies on is that any random variable X with given generalized moments 
Ef;(X) = wi, i = 1,...,k, can be replaced by one supported on & + 1 points at 
most that has identical expectations of each f; (see Richter [307], Rogosinsky [310]). 
Therefore it suffices to confine ourselves to establishing the extremes of the(k + 1)- 
st coordinate of the intersection of the convex hull of (fi(x),..., fx(%), g(x)) for 
all x from the domain of X, which is the space of all possible generalized moments, 
with the line « +> (t,...,x%,2), representing the moment conditions. If X has 
a compact domain, and all f;, 2 =1,...,k, are continuous and g is semicontinuous, 
then either of extremes is attained (see Kemperman [Theorem 6, 229]). If g is 
not so, the geometric method would still work if we consider the extremes of g in 
infinitesimal neighborhoods of the moment points (see Kemperman [Theorem 6, 
229]). 

We use the Kemperman moment theory for calculating (19.1) and (19.2) in 
case a < +oo. For a = +00, we deduce the solutions from these derived for 
finite supports with a — +00. We first note that the problem is well posed iff 
m = (m,m,) € M(a,r) = convF(a,r) with F(a,r) = {(t,t") : 0<t < a}, 
icc. m lies between the straight line m, = a’~'m, and the curve m, = m}, for 
0 < m, <a. The bounds in question are determined by the convex hull of the 
graph G(A,a,r) of three-valued function [0,a] 3 t + (¢,t", Alt] + (1 — A)[t]) for 
fixed A,a and r, which will be further ignored in notation for shortness. We easily 
see that for n denoting the largest integer smaller than a, 


n 


G= {Get ,k+1-d): k<t<k+1fU LJ (kk), 


k=0 k=0 
for a € N, and 
n-1 n 
G= {Gt k+1—-d\): k<t<k4+1fU {kk} 
k=0 k=0 


U{(t, tn +t1—A): n<t<a}, 


for a ¢ N. Since gy is not semicontinuous for 0 < A < 1, we could use the 
Kemperman method once we consider the closure G that arises by affixing points 
Uy = (k,k", k4+1—-—A),k =0,...,n, and Ly = (kik, k—A), k =1,..., [a]. Visually, 
G consists of subsequent closed pieces of the planar curve t +> (t,t”) with integer 
ends at k and k+1, located horizontally at the respective levels k + 1 — A, denoted 
further by Ue bpans and separate points Cy, = (cz, k) = (k,k",k) € L,Ug, standing 
for the vertical line segment that joins Ly, with U,. On the left, we start from the 


Moment Theory on Random Rounding Rules Using Two Moment Conditions 251 


origin Co and pies. and at the right end we have either Ut. and C, fora € N 
and the trimmed curve U,,A with A = (a,n+1—A) = (a,a",n+1-— 4) otherwise. 
We adhere here to the convention introduced in Rychlik [313] of writing points of 
two- and three-dimensional real space in bold small and capital letters, respectively. 
Observe that the notation of a = cg and A = L, is duplicated for natural a. In the 
sequel, we shall use a and A then. 

In Section 19.2 we present analytic expressions for (19.1) and (19.2) and de- 
scribe the distributions that attain the respective extremes. E.g., writing U(m) = 
D(ti,...,tk), we indicate that {t1,...,t,} C [0, a] is the greatest possible support of 
X with extreme Eg,(X). The distribution of such an X is unique iff k < 3, and then 
the probabilities of all t, are easily obtainable. However, for some m € M we obtain 
even infinite support sets, and we drop the formal description of all respective distri- 
butions for brevity. If a bound is unattainable, we furnish some t; with the plus (or 
minus) sign which would mean that the bound is approached by sequences of distri- 
butions with support points tending to the respective arguments of D from the right 
(or left, respectively). The formal presentation of the results will be supplemented 
by a geometric interpretation, i.e. the description of the upper and lower envelopes 
M ={M=(m,U(m)): m€ M} and M={M= (m,L(m)): m € M}, respec- 
tively, of the convex hull generated by G = G(\,a,r). In fact, our proofs consist 
in solving the dual geometric problems. These will be contained in Section 19.3, 
together with some purely geometric lemmas whose statements will be repeatedly 
applied in our reasoning. 


19.2 Results 


There is a magnitude of different formulas for (19.1) and (19.2) valid in various 
domains of parameters. For the sake of clearness, we decided to present the upper 
and lower bound for r > 1 and 0 < r < 1 in separate theorems. In each case, we 
further choose specific levels of the support range condition a, and finally, for given 
rand a, we analyze the moment conditions for different regions in M(a,r). Since U 
and L are continuous in m, various formulas provide identical values on the borders 
of neighboring regions. Below we apply the convention of presenting the formulas 
on the closed parts of M. 
It will be further useful to define t;, = t,(m,r) 4 k by the equation 


(my, — k”) (te — k) = (mi — k(t, — &") (19.3) 


for some m € M(a,r) and k € [0,a]. Note that ty = (ty, t,) is the intersection of 
the straight line running through (k, k") and m, and the curve t + (t,t”). It is easy 
to confirm that t, is a uniquely determined point in [0, a]. 


Theorem 19.1. (Upper bounds in case r > 1). 


(a) Suppose that N Za < +00. 
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(i) If for some k =0,...,n—1, 


kom <k+1, 


mi <m, < [(k +1)" —k (my —k) +k’, 


then 
U(m) = Dltepak+ 1+) = b+ 2-4 PEL, 
(ii) If 
nom <a, 
my Sm, < Soe) + nr, 
then 


U(m) = D((n,a]) =n+1-X. 
(iti) If for some k =0,...,n—1, 


k < my < k+ 1, 
[((k +1)" —k™](my —k) +k” <m, < nm, 


then 
U(m) = D(04+,...,n+) =m, +1-A. 
(iv) If 


O0<m, <a, 


max{n"~!mj, (aten")Gmi=n) + n"} < Mr < a’—!m, 


then 
U(m) = D(0+,n+, a) 
=1-\4 Sete 
(b) Let NS a<+oe. 
(i) If (19.6) holds, then 
U(m) = D(ta,a) =a— eae 
(it) If (19.10) holds, then 
U(m) = D(0+,n+, a) 


(1—A)(n™~'mi—m,) 


=m+1-A++S =. 


d 


(19.4) 


(19.5) 


(19.6) 


(19.7) 


(19.8) 


(19.9) 


(19.10) 


(19.11) 


(19.12) 


(19.13) 


Otherwise the statements of (ai), (atti) remain valid for the integer a as well. 


(c) Leta=+o0. 
(i) Condition (19.4) for some k € NU {0} implies (19.5). 
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(ii) If fork ENU {0} 


kom <k+1, 


my > [(kK+ 1)" —k"](m, —k) +k’, (19.14) 


then 


U(m) = D(0+,14,...)=mi+1—-d. (19.15) 


Remark 19.2 (Special cases). If a < 1, then (19.6) is satisfied by all m € M and 
so we find single formulas (19.7) and (19.12) for a < 1 and a = 1, respectively. 
For 1 < a < 2, considering (19.8) is superfluous, because it has no interior points. 
However, as a increases, the upper constraint for m, in (19.8) tends to +oo for all 
my so that in the limiting case we obtain two types of conditions only. 


Remark 19.3 (Geometric interpretation). Suppose first that a is finite and non- 
integer. If a > 2, none of conditions can be dropped, and the constraints space M 
is divided into n + 3 pieces. Condition (19.4) states that m € conuc,¢é,+1 for some 
k = 0,...,n —1, whereas (19.5) implies that the respective part of the upper en- 
velope of convG coincides with the curved part of the surface of the cone with the 
horizontal base convU;Le+1 and vertex Uz41. The part is spread among Uz) and 
Uli: and will be written co(Uz41, Ugliea) for short (case (ai)). If m € convé;a 
(see (19.6)), the respective piece of M is convC,,A, i.e. loosely speaking, convénda 
lifted to n+1—) level (case (aii)). The convex polygon convcg .. . Cn, defined in (19.8) 
corresponds to a fragment of plane slanted to one spanned by {(m,0): m © M} 
and crossing it along the line m; = A —1 at the angle 7/4 (case (aiiz)). Formula 
(19.11) asserts that is a point of the plane generated by Uo, Un, and A (plUpU, A, 
for short). Since (19.10) describes the triangle Acoc,a with vertices co, cy and a, it 
follows that the respective part of M is identical with AUoU,,A (case (aiv)). 

The case of natural a differs from the above in replacing A by C, which is situated 
at the higher level than A. In consequence, for convé,a we obtain co(Ca, U7, A) (see 
(19.12)) instead of the flat surface convC,,A as in (19.7). Also, form € Acocna, we 
get AUpUnCa, which is the consequence of (19.10) and (19.13). Note that the line 
segment U,,C, is sloping in contrast with the horizontal U,A. For the remaining 
n+ 1 elements of the partition of M, the shape of the upper envelope is identical 
for integer and noninteger a. The shape of M for a = +00 can be easily concluded 
from the above considerations. 


Theorem 19.4 (Lower bounds in case r > 1). 
(a) If0<a<1, then for allm € M 
L(m) = D(0,to) = (1 — A)my/to. (19.16) 


(b) Suppose that 1 <a< +oo. 
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(i) Conditions 
0 < my < 1, 
my <M, SM, 
imply (19.16). 

(ii) If for some k =1,...,n—1, (19.4) holds then 
tr — My 
th—k ° 
(itt) Under the condition (19.6), we have (19.18) with k =n. 
(iv) If fork =1,...,n—1, 


L(m) = D(k-,t,) =k-- 


kim <k+1, 


+ = my, —k) + < mr < (n® Dm -1) + 
[(k +1)" — k"]( k) + kr (n™—-1)(m1—-1) i 


n—-1 


then 


(v’) If 
0 < my S n, 
1; n—1 S™Mrp S71 m1, 
then 
L(m) = D(0,1-,n-) 
—m+ yee Dimer (nm : 
(vi’) If 
0 < my < a, 
max{n"—!m1, (arn man) +n} < mM, < a™ tm, 
then 


L(m) = D(0,n—,a—) = (1— 4) my 
eS [(n—A)(a—n)—n](n"~!mi—m,) ; 


If the inequality in (19.21) is reversed, then the following holds. 
(v”) For 


0 S my < a, 
max{m}, (ort) +1} <m, <a™!m 


b] 
we have 


L(m) = D(0, 1-, a—) 
— GAjlatmi-amr)+(nt1—A)(mr—m1) 


(19.17) 


(19.18) 


(19.19) 


(19.20) 


(19.21) 


(19.22) 


(19.23) 


(19.24) 


(19.25) 


(19.26) 


(19.27) 
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(vi”) For 
l<m, <a, 
max{ 2 —Vem—1) 4 1 Gen mma) 4 _r} (19.28) 
<m, < SD) +1, 
yields 
or i (oe Sa it ule nai 1) (19.29) 


(a"=1)(n—1=(n"=1)(a—1) 


(c) Let a = +00. 


(i) Condition (19.4) implies (19.16). 
(ii) If (19.4) holds for some k EN, then we get (19.18). 


(iti) If 
ens. (19.30) 
Mp 2™1, 
then 
L(m) = D(0,1—,+00—) = (1— A)m1. (19.31) 
(iv) If some k EN the condition (19.14) is satisfied, then 
L(m) = D(1-,2-,...) =m —». (19.32) 


Remark 19.5 (Special cases). It suffices to consider the case (biv) for a > 3. If 
1 <a< 2, neither of m € M satisfies the respective condition. If 2 < a < 8, 
(19.19) determines a line segment that can be absorbed by neighboring elements 
of partition. For 1 < a < 2, we can further reduce the number of subcases. First 
note that (bit) can be dropped. Also, both the (19.22) and (19.28) describe linear 
relations of m1 with m, and so (bv’) and (but) can be omitted. Furthermore,both 
the (19.24) and (19.26) are equivalent, and result in the bound 

ne 7 ou Das (2—d)(mi—m-) (19.33) 


a"™—a 


identical with (19.25) and (19.27). Hence, for 1 < a < 2, it suffices to treat three 
possibilities (bz), (béiz), and (19.26) with (19.33). Observe that there is no need to 
examine the integer and fractional a separately, because in the former case, unlike 
to the supremum problem, we can always replace a € N by a— which does not 
violate moment conditions and decreases the rounding. In fact, we can take the 
minus away from a in describing the support of extreme distribution in (19.25), 
(19.27), (19.29) and (19.33) unless a € N. 


Remark 19.6 (Geometric interpretation). If a < 1, then M consists of the line 
segments CoTo, with to(m) varying between 0 and 1, which constitute the curved 


—— —. 


conical surface co(Co,UpA) of the cone with the vertex Co and base convUoA. 
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Putting a = 1, we obtain the case (bi). We have analogous solutions in the cases 
(bit) and (biti), referring to m € convégcegsi, k = 1,...,n—1, and m € convena, 
respectively. In these domains, M coincides with n — 1 surfaces co(Ly, Uplicni 
k =1,...,n—1, and co(Ln, UA), respectively. The condition (19.19) determines 
the polygon with n vertices c1,...,Cn, and L(m) is the equation of a plane parallel 
to one including a part of the upper envelope (cf (19.9) and (19.15)). Note that one 
lies the unit away from the other and all the moment points are situated between 
them (cf Remark 19.7 below). This implies that the maximal range of the expected 
rounding with given moments is 1 and this is attained iff the conditions of (biv) (or 
(civ)) hold. 

For finite a, it remains to discuss m of the tetragon conucoc,cna. Inequality 
(19.21) states that A is located either in or above the plane plCoLiL,. If it is 
so, then for m € Acocicn (see (19.22)) M belongs to plCoLi Ly (cf (19.23)), and, 
in consequence, to ACpLiL,. Likewise, formulas (19.24) and (19.25) assert that 
M = (m,L(m)) € ACoL,A when m € Acocya. If A lies on the other side of the 
plane, we divide convcocicna along the other diagonal ¢7@. Similar arguments lead 
us to the conclusion that m € Acocja and m € Acicna imply that M € ACoL,A 
and M € AL,L,,A, respectively. 

We leave it to the reader to deduce from Theorem 19.4 that for a = +oo, M 
is partitioned into the sequence co(Co, Ucn) (cf (ct)), co(Lr, Uy), k EN, (ef 
(cii)) and two planar surfaces (see (citi) and (civ)). 


Remark 19.7. Note that the bounds (19.9) and (19.20) are the worst ones, and 
follow easily from 

X-—rA<[X]-A=A[X]-1+(1-A)[X] 
(1—A)[X] 
(1—-A)\([X]+1)<X+4+1- 2. 


Conclusions for the case r < 1 are similar and will not be written down in detail 
for the sake of brevity. Instead, we merely modify Theorems 19.1 and 19.4. 


— 
i= 


Theorem 19.8 (Upper bounds in case 0 < r < 1). Reverse the inequalities for 
my, in (19.4), (19.6), (19.8), (19.10), and (19.14). Replace max by min in (19.10), 
and suplement (19.14) with m, > 0. Under the above modifications of conditions, 
the respective statements of Theorem 19.1 hold. 


Theorem 19.9 (Lower bounds in case 0 <r <1). Reverse the inequalities for m, 
in (19.4), (19.6), (19.17), (19.19), (19.22), (19.24), (19.26), (19.28) and (19.14), 
adding m, > 1 to the last one. Substitute min for max in (19.22), (19.24), (19.26) 
and (19.28). Then the respective conclusions of Theorem 19.4 hold. If, moreover, 
we change the roles of my and m, in (19.30), then 
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19.3. Proofs 


Lemma 19.10 can have wider applications in the geometric moment theory than for 
the two moment condition problems. At the sacrifice of meaningfulness of three- 
dimensional geometry arguments, we therefore decided to formulate the result and 
employ arguments valid for the spaces of any finite dimension. 


Lemma 19.10. Let M’ Cc M be open and not overlapping with F, and 0M' 
stand for its border. Then each M = (m,U(m)) € M (M = (m,L(m)) € M) 
form € M' can be represented as a convex combination of M; = (m;,U(mi)) 
(M, = (m;)),U(m;) with m; € OM’. Accordingly, the subset of G generating the 
part of the upper (lower) envelope form € M’ coincides with that form € 0M’. 


Proof. We only examine the upper envelope case. The other can be handled in 
the same manner. Take an arbitrary M = (m,u) such that m € M’ and has a 
representation 


for some positive a@; amounting to 1. The condition M’MF = @ implies that 
none of m; € M’. We can partition all M; into three, possibly empty, groups: 
the M; = (m;,U(m,)) with m; € OM’, the M; = (m;,ui) with m; € OM’ and 
u; < U(m;), and the M; with respective m; ¢ 0M’. Our aim is to modify (19.35) 
so to preserve the moment conditions, despite of replacing some m,; by border 
points, and not to decrease the criterion functional. To this end, it is sufficient to 
replace the elements of the second and third groups. In the former case, we simply 
substitute M; = (m;,U(m;)) for each Mj. 

Less evident arguments are used for eliminating M; with m; ¢ OM’. To be 
specific, suppose that so is M1, and we rewrite (19.35) as 


ef} 


M =a,M,+(1—a1) >> 

iAl 

Note that m and m, lie on the other sides of 0M’. Let mi, = Gm, + (1 — 8)m, 

0 < B <1, be a crossing point of Mm with OM’, and M] = 6M, +4 (1 —- B)M. 
Then m € m{,m_, as well, and likewise 


M; —- a, My, + (1 = a1) M_1. 
l-a, 


= a BU-ai) 
M = aypamayMi + ataa—anyM-1 
= 7M; + (1 -7)M-1. 


(19.36) 
Substituting 7, = (m’,,U(m\)) € convG for M! yields 
M! = 7M, + (1—7)M-1, 


or equivalently 


(m,u’) = (mj, U(m4)) + (1 — y)(m-1, u-1). 
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This has two desired consequences: we replace M, with m, ¢ 0M’ by M, with 
m‘, € OM’, and derive u’ > u. Repeating the procedure finitely many times, we 
eliminate all M; from the third group, improving the approximation of M at every 


step. 


Lemmas 19.11, 19.12 and 19.13 follow from Lemma 19.10 and will be applied 
directly in the proofs of our original problems. 


Lemma 19.11. Assume that MjM; CM (M), i,j =1,2,3. Then AM, M2M3 c 
M (M) as well. 


Proof. Project AM, M2 Mz onto the space of moment conditions. It is evident that 
the respective projection Am;m2mz3 has no interior points common with F. Since 
each of the edge points of AM; M2Ms3 is a convex combination of the vertices, and 
belongs to M (M), then, by Lemma 19.10, for every M (M) with m € Amym2m3 


we have 
3 


M (M) = S¢aiM; € M(M) 

i=l 
for some convex combination coefficients a;, i = 1, 2,3. All the combinations com- 
pose AM, M2 M3. 


Lemma 19.12. Suppose that all the edges M;Mj41, i = 1,2,3, and M4M, of 
the tetragon conuM,M2M3My, are contained in M. Then we have AM,M2M3 U 
AM, M3M, Cc M if M4 does not lie beneath p | M,M2M3, and AM,M2M, U 
AM2M3M, Cc M if M4 does not lie above p | M,M2M3. 


Proof. In the standard notation, m; stands for M; deprived of the last coor- 
dinate. Analysis similar to that in the proof of Lemma 9.11 leads to the con- 
clusion that (m,LZ(m)) is a convex combination of M;, i = 1,2,3,4, for every 
Mm € Convm1M2M3M4. 

By the Richter-Rogosinsky theorem, one of M; is here redundant for a given 
m. Let mp and Mo denote the crossing points of the diagonals of the respective 
tetragons. Consider, e.g., the case m € Am ym2mo. The the respective M can 
be written as a combination of either of the triples M1, M2, M3 and My, M2, M4. 
The former occurs if M4 is located over plM,M2Ms3, i.e. AM,M2Msz3 lies beneath 
AM, M2M4. The latter is true in the opposite case. Noting that the former con- 
dition is equivalent with locating M2 over plM,M3M4g, we similarly analyze three 
other triangular domains of moment conditions and conclude our claim. In partic- 
ular, if M;, i =1,2,3,4, span a plane, then convuM,M2M3M, € M. 


It is easy to obtain a version of Lemma 19.12 for M;M; C M. This was not 
formulated here, because the result will be used for determining the lower bounds 
only. 


Lemma 19.13.(i) Consider Ua Tas CG and View = (k+1,(k+1)",v) for some 
v>k+1—X. If Ue CM, then co (Vii, Up Lexi) CM. 
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(ii) Let TS Cc Ue lices NG and Vy, = (k,k",v) for somev <k+1-—. Then 
Vi CM implies co(Vp, U;,S) cM. 
The point Vi+1 defined in Lemma 19.13(7) lies just above L,41 and will be replaced 
by either Uz41 or Cg in the proof of Theorem 19.1. Also, Vy, lies under U;, and will 
be further replaced by either Co or Ly as well as S stands for either Dy41 or A. 


Proof of Lemma 19.13 (i) Since both the U,V,41 and Ue, belong to M, 
due to Lemma 19.10, every M with me CONVERCKL1 can be written as a mixture 
of at most three points from Ula U {Vii}. Combining elements of the curve 
only, we do not exceed k + 1— A, while introducing Vi41, being possible for any 
me CONVERCh41, enables us to rise above the level. Assume therefore that M = 
(m,u) € AM1MoVp41 for My, Mz € UjLeai, or equivalently M € M3Veqi for 
some Ms = (m3,k+1—A) € M1 Mg. Note that the line running through m and m3 
crosses F at cy41 and ty41 = (te41,th41) € €eCeti (see (19.3)). Consider Ti.41Ve41 
for Th41 = (te41,k+1— 4). It is easy to see that Ty41V,41 lies above MsV,.41 and, 
consequently, it crosses the vertical line t +> (m,t) at a level u’ > u, say. In fact, the 
construction is unique and so it determines M. Accordingly, for all m € conucrpépii 
the respective M € TyiiVea1 C M, where T,41 runs along the whole Clg as 
m varies. The Ty41V¢41 compose co(Vp41, Cele 
(it) Analogous. 


Proof of Theorem 19.1 (a) It is obvious that the (possibly limiting) distribution 
of X that attains the upper bound U(m,) = sup £g)(X) under the single moment 
condition EX = mj, and satisfies EX" = m,, provides also the respective bound 
U(m1,m,) for the two moment problem. Therefore we directly apply the solution 
presented in Anastassiou and Rychlik [89]. It follows that U(m,1) = U(m,m,) = 
m, +1-—A for (mi,m,) € convcgo...¢, and U(m,) = U(m1,m,) = n+1-— A for 
(m1,M,) € convé;a which solve (aizi) and (aii), respectively. The latter can also 
be derived by noting that n+ 1 — A is the greatest possible value of g,(X) attained 
when n < X <a. 

Assume now that m € conv ¢xCki1 for some k = 0,...,n — 1. Noticing that 
U;,.Un41 © M and applying Lemma 19.13(i) with Vii1 = Un41, we conclude that the 
respective part of the upper envelope is co(Ux41, Uelicei) (case (ai)). It remains 
to consider m € Acocna. Observe that UjpA Cc M, because M € UpA uniquely 
determine limiting distributions supported on {0+,a}, and UpU;,,U,A Cc M (cf 
(aiii) and (aii)). By Lemma 19.11, AUpU; A C M (case (aiv)). 

(b) From the solution of the single moment problem we derive some parts of M. 
These are convUg...Un, UnCa, and clearly all the Uli cae, and UopC,. The first 
one provides the solution for m € conuco...€n, identical with that for noninteger 
a, and allows us to repeat the arguments and conclusions of the proof of (az) for 
m € Cea, kK =0,...,n —1. In the similar way we use Lemma 19.13 and U,Cg C 
M to deduce that M € co(Cu, U,A) if m € convucya (case (bi)). Finally, optimality 
of UpU,, UnCg and UpC, and Lemma 19.11 enable us to complete M by adding 
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AUoUnCa (case (bit)). 

(c) Let a + +00. Then extending the range of X does not affect the solution 
for m € convéxcg 41 for a fixed integer k > 0 (case (ci)). On the other hand, the 
polygons convey... Cn gradually appropriate the whole area above the broken line 


Unio teeeq1 (case (cit). 


Proof of Theorem 19.4. (a) The assertion follows from the direct application of 
Lemma 19.13 (#1) with Co and A standing for V;, and S, respectively. 

(b) A partial solution is inherited from the one moment problem. Its geo- 
metric interpretation describes the following parts of the lower envelope: Col, 
convL,...L, and L,A. Combining Col; Cc M with Lemma 19.13(7i) implies 
co(Co, UoL1) Cc M which is equivalent with (bi). The polygon with the vertices 
[y,..., Ly describes the solution of case (biv). Also, this allows us to deduce from 
Lemma 19.13 that co(Lr, Cpl ea) Cc M for k =1,...,n—1, (see (bit)). Yet another 
application of Lemma 19.13 with Vi. = L,,A yields co(Ln, U,,A) CM (see (biit)). 

Still remains the task of determining the lower bounds for m € conucgc,cya. 
Knowing that OconvCoL,L,A Cc M, we can employ the assertions of Lemma 19.12. 
If A is situated either on or above plCo Li Ln, ie., (19.21) holds, then ACo LiL, and 
ACoL,A are parts of M which correspond to the solutions in cases (bv’) and (bvi’), 
respectively. Otherwise, we have ACoL; AU AL,L,A C M (see (bv) and (bvi”)). 

(c) Including arbitrarily large points to the possible support does not change the 
solution for m € convéxcpy1 and a given k € NU{0}, which is identical with that for 
any finite a > k+1. Since (7G, tends to the vertical halfline starting upwards from 
c,; in the plane of M, the domain of the trivial lower estimate m, — A ultimately 


covers the region above U7__., GCkq1- 

The only point remaining is establishing the shape of M for m situated above 
coc. Take any finite-valued distribution satisfying the moment conditions and rep- 
resent it as a mixture of ones supported on [0,1] and (1, +00), respectively. The rel- 
evant pairs of the first and rth moments satisfy m1 € conuégc, and mz € CONVE| Coo, 
and m € 7m. The respective Mj, Mo runs above Cpl and the horizontal halfline 
tr (1,t,1—A), t > 1, both belonging to M. It can be replaced by Mj Mé that joins 
CoL; and the halfline, and is located below M,Mz. The family of all such M {Ms 
determines the inclined band of M = (mi,m,, £(m1,m,)) described by (19.30) and 
(19.31). 


Remark 19.14. (Proofs of Theorems 19.8 and 19.9). The analogy between the 
cases r > 1 and r < 1 becomes apparent once we realize that both the cases have 
the same geometric interpretations (cf Remarks 19.3 and 19.6). The only exception 
is Proposition 19.4 (ciiz) and its counterpart (19.34) in Theorem 19.9. Since we use 
only geometric arguments in the above proofs, there is no need to prove Theorem 
19.8 and 19.9 separately. The reader is rather advised to follow again the proofs 
of Theorem 19.1 and 19.4 as if r < 1 were supposed. Also, one can treat the 
above mentioned difference slightly modifying the last paragraph in the proof of 
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Theorem 19.4. For r < 1, we consider m lying right to Co¢7, and replace the half 
line t+ (1,t,1 — A) by t% (¢,1,t—A) for t > 1. The latter, together with Col; 
span the plane described by the equation (19.34). 
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Chapter 20 


Prokhorov Radius Around Zero Using 
Three Moment Constraints 


In this chapter we find exactly the Prokhorov radius of the family of distributions 
surrounding the Dirac measure at zero whose first, second and fourth moments are 
bounded by given numbers. This provides the precise relation between the rates of 
weak convergence to zero and the rate of vanishing of the respective moments. This 
treatment relies on [88]. 


20.1 Introduction and Main Result 


We start by mentioning the concept of Prokhorov distance of two probability mea- 
sures jJ,V’, which is generally defined on a Polish space with a metric d. This is 
given by 
m(p, Vv) =inf{r >0: w(A) < v(A") +7, v(A) < WA") +7, 
for every closed subset A}, 

where AT = {x: d(a,A) <r}. Note that in the case of standard real space with 
the Euclidean metric, the Prokhorov distance of a probability measure p to the 
degenerate one do concentrated at 0 can be written as 


w(t, 00) =inf{r >0: wl) >1—r}. (20.1) 


Here and later on I, = [—r,7r], and I¢ stands for its complement. 
For a given triple of positive reals € = (€1, €2,€4), we consider the family M(€) 
of probability measures on the real line such that 


M(E) = {u: | fe ay <j, i=1,2,4}. (20.2) 
Theorem 20.1 gives us the precise evaluation of the Prokhorov radius 


D(E)= sup 7(u,d0) 
bEM(E) 


for family of measures (20.2). 
Theorem 20.1. We have 
D(e1, €2, €4) = min{es!®, ef}. (20.3) 
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This is a refinement of a result in Anastassiou (1992),[19] where the Prokhorov 
radius D(€1,€2) = ef * of the family with constraints on two first moments was 
established. The problems of determining the Levy and Kantorovich radii under 
two moment conditions were considered in Anastassiou (1987),[18], and Anastassiou 
and Rachev (1992),[83],respectively. Anastassiou and Rychlik (1999),[86], studied 
the Prokhorov radius of measures supported on the positive halfaxis which satisfy 
conditions on the first three moments. Since the Prokhorov metric induces the 
topology of weak convergence, formula (20.3) describes the exact rate of weak con- 
vergence of measures from M(E€) satisfying the three moment constraints to the 
Dirac one at zero. 

Though our question is stated in an abstract way, it stems straightforwardly 
from applied probability problems in which rates of convergence of random vari- 
ables to deterministic ones are evaluated. If we study how fast the random error 
of a consistent statistical estimate vanishes, then zero is the most natural limit- 
ing point. Convergence in probability is implied by that of the first two moments. 
Adding the fourth one, which has a meaningful interpretation in statistics, allows 
us to find refined evaluations. These three moments have natural estimates, and 
so one can easily control their variability. Moreover, the respective power functions 
form a Tchebycheff system. Convergence of integrals for elements of such systems 
implies and provides estimates for integrals of general continuous functions. The 
latter convergence is described by the weak topology, and the presented solution 
gives a quantitative estimate of uniform weak convergence (expressed in terms of 
equivalent Prokhorov metric topology) for a large natural class of measures deter- 
mined by moment conditions. 

Formula (20.31) is determined by means of a geometric moment theoretical 
method of Kemperman (1968),[229], that will be used in Section 20.2 for calculating 


L,(M)= inf I, 20.4 
(M)= int, nll.) (20.4) 


with given M = (m1, me2,ma4) and 


M(M) = {ui f thdu=ms, i=1,2,4} 


for all possible m ,,m2,m4, and r > 0. In Section 20.3 we present the main result 
of the chapter; having determined (20.4) for various M, we first evaluate respective 
infima over the boxes in the moment space 


L,(E) = inf{L,(M) : |mi| < 6, 1 = 1,2, 4} (20.5) 
for every fixed r, and then, letting r vary, we determine 
D(E) =inf{r>0: L,(€) >1-r}. (20.6) 


In the short last section we sketch possible directions for a further research. 
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20.2 Auxiliary Moment Problem 


Fixing r > 0, we now confine ourselves on solving moment problem (20.4). This is 
well stated iff 


M €W= {(m1,m2,m4): m1 € R, mg > m7i,m4 > m3}. 


Note that W = convT = conv{T = (t,t?, t*) : t © R}, the convex hull of the graph 
of function R 5 t + (t,t?,t*). Geometrically, W is a set unbounded above whose 
bottom W is a membrane spanned by J. The membrane can be represented as 
W=Us,s)7_-T+, where T_ = (-t,t?,t*), T, = (t,t, t*), and AB denotes the line 
segment with end-points A and B. The side surface consists of vertical halflines T™ 
running upwards from the points T € J. Consider the following surfaces in W: 


AOR,R_ — the triangle with vertices 0 = (0,0,0), Ry = (r,r?,r*) and 
R= (-7,r7,7°), 

mem(R,,0R,) = Waste gE ie, end mem(R_,OR_) = U_,cye9 FR- the mem- 
branes connecting R, and R_ with the points of the curves OR4 = Vie te 
0<t<r}and OR_ = {(-t,t?,t4) : 0<t <r}, respectively, 

R_R, |, OR, , and OR_' — the infinite bands above the line segments R-Ry, 
OR, and OR_, respectively. 


They partition W into five closed subsets with nonoverlapping interiors: 


W — the set of points situated on and above AOR, R_, 

W, — the moment points on and above mem(R4, OR), 

W 3 — the points on and above mem(R_,OR_), 

W4 — the points between AOR,R_, mem(R4, OR), mem(R_, OR_), and W!r = 
Uo<r<, LT, the last surface being a part of the bottom of the moment space, 


W; — the moment points lying on and above W! = Uris, fT. 


The solution to (20.4) is expressed by different formulae for the elements of the 
above partition. 


Theorem 20.2. The solution to (20.4) is described by 

1 = m/r*, if MEW, 

(r — |mi|)?/(r? — 2|malr + ma), if M € We U Ws, 
(r? — m2)?/(r4* — 2mer? + m4), if M € Wa, 

0, if Me Ws. 


L,(M) = (20.7) 


One can easily confirm that the formulae for neighboring regions coincide on their 
common borders. In particular, this implies continuity of L,.. 


Proof of Theorem 20.2. First notice that Ws is the closure of the convex hull of 
T (1°) = {(t,t?, t*) : |t| > r}. The inner elements of W; are the moment points for 
measures supported on [° and therefore L,(M) = 0 for all M € Ws. 
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The other formulae in (20.7) will be determined by means of the optimal ratio 
method due to Kemperman (1968),[229], that allows us to find sharp lower and 
upper bounds for probability measures of a given set (here: the lower one for those 
of I,) under the conditions that the integrals of some given functions with respect 
to the measures take on prescribed values (here: f t' dw = mj, i = 1,2,4). The 
method can be used under mild assumptions about the structure of probability 
space and functions appearing in the moment conditions (cf Kemperman (1968, 
Section 5),[229]). These are satisfied in the case we consider and therefore we 
merely present a version adapted to our problem instead of the general description. 
Given a boundary point W of W, W ¢ Ws,we use a hyperplane H supporting W 
at W, and another one H’ supporting Ws that is the closest one parallel to H. Then 
for every moment point M in the closure of conv(W NH) U (W; NH’), we have 


d(M,H’) 

L,(M) = WH) ' (20.8) 
where the numerator and denominator in (20.8) denote the distances from the 
moment point M and hyperplane H to H’, respectively. 

We shall therefore take into account the hyperplanes H supporting points W € 
Unter T! UW". First consider the vertical plane H : m2 = 0 that supports W 
at all points of 0'. Then H’ : mz — r? = 0 is the closest and parallel to H plane 
that supports W;. Since HN W = Ol, and H’ NW; = Pak. then for every 
M € conv0! UR_R,| = Wi, we apply (20.8) to find L,(M) = 1—mg/r?. 

Consider now a side hyperplane H such that 2M W = T! for some 0 < t <r, 
described by the formula H : mz — 2tm, +t? = 0. We can easily observe that H’ 
is the plane parallel to H that supports Hs along Bi This can be written as 
H' : mz — 2tm, + 2tr —r? = 0. Applying the standard formula 


ee 7 1/2 
Sani (S08) 
i=1 i=1 
measuring the Euclidean distance between a point Y = (y1,...,Yn) and a hyper- 
plane A: S>\"_, a;x; + b =0 in R”, we obtain 
d(M,H') = |m2 — 2myt + 2tr — r?|/(1 + 4t?)¥/?, 
d(H, H’) = d(H, Ry) = (r — 4)? /(1 + 407), 


d(Y, A) = 


and, in consequence, 


|m2 — 2mit + 2tr — r?| 


for all M ¢€ convT | U Ri = 7 ee Representing M as a point of the plane 
containing Tone 3 we get m2 = (r+t)(m; —r)+r?, which enables us to express t 
in terms of m, and mz as 

my zr — mg 


t= 
r—- my 
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This substituted into (20.9) yields 
(r == m4)? 
= aa (20.10) 

Note that this is valid for all M € W2 = Upe;e> Toh 

The respective formula for M € W3 is obtained by replacing m, by —m, in 
(20.10). This is justified by the fact that the arguments of the optimal ratio method 
are purely geometric, and both W and Ws; are symmetric about the plane m, = 0. 

Consider now a plane H that touches the bottom side of W along T_T, for 
some 0 <t <r. This is defined by the formula H : m4 — 2t?m2 4+ t* = 0. Then 
H' : ma — 2t?m2 — r* + 2t?r? = 0 is the closest parallel hyperplane to H that 
supports Ws; along R_R,. Arguments similar to those applied in the analysis of 
the side hyperplanes produce 


d(M,H’) = |ma — 2mgt? + 2t?r? — r4|/(1 + 4t4)1/?, (20.11) 
d(H, H’) = (r? — #7)? /(1 4+ 4041/2, (20.12) 
where 
2 
2 mgr” — m4 
? = ——— (20.13) 


re — mg 

is determined from the equation m4 = (r? + t?)(m2 — r?) + r4, defining the plane 
that contains both T_T, and R_R,. Dividing (20.11) by (20.12) and substituting 
(20.13) for t? gives the penultimate formula in (20.7). Observe that this is valid for 
the moment points of the trapezoids convT_T, UR_R,, 0 < t < r, whose union 


forms W4. This ends the proof of Theorem 20.2. 


20.3. Proof of Theorem 20.1. 


We first confirm that fixing m2 and m4 we minimize L,.(m1,m2,m4) at m, = 0. 
Note that L,(M) for M € W, UW4U Ws does not depend on the value of m1, and 
(m1, Mm2,ma) € W; implies (0,m2,ma4) € Wi, i = 1,4,5. Differentiating the second 
formula of (20.7) with respect to |m1|, we derive 
OL,(M) _ 2(r = |mil)(|mailr — ma) 
Olmy| (vr? — 2] my |r + m2)? 

which is nonnegative for M € W2U W3, because |m | < r and mg < |mj|r there. 
Therefore we decrease L,(M) moving M € W2U W3 perpendicularly towards the 
plane m, = 0 until we reach the border. Then we can move further entering either 
W, or W, that would not result in change of L,(M/) until we finally arrive at 
(0, m2, ma). 

Evaluating (20.5) we can therefore concentrate on the moment points from the 
rectangular 


Ro = {(0,m2,ma) : mi S ei, t= 2,4}. (20.14) 
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The points of Ro NW may generally belong to any of W1,W4.4 and W;. However, 
if some M € Ro NWs, which is possible when €2 > r? and €4 > r*, then L,(M) = 
L,(E) = 0, which is useless in determining (20.6). Otherwise the moment points 
of Ro belong to either Wy, or W4. In the former case L,. is evidently decreasing in 
mg and does not depend on mz, (20.7). In the latter, L, is decreasing in m4, and 
increasing in mz, because mz < r?, m4 < mgr”, and so 
OL,(M) _ 2(r? — m2)(mer? — ma) Sai 
Ome (r4 —2mer2+m4)? ~ 
for M € Ro NW4. 
We now claim that L, is minimized on (14) at E = (0,¢,er?) with « = 
min{€2, €4/r?} so that 
L(6) SLi Eyal ee, (20.15) 
The last equation follows from the fact that E € W,. We prove the former using 
the following arguments. First see that if «4 > egr?, we can exclude from con- 
siderations all points situated above Ej)H for Ey = (0,0,er”). Indeed, any point 
M = (0,m2,ma) of this area can be replaced by M’ = (0,mz, er?) € EoE so that 
L,.(M’') = L,(M). Then we exclude all points of Ro below OF, which belong to W4. 
Keeping mz, fixed and decreasing m2 until we reach OF, we actually decrease L,.. 
What still remains to analyze is AOE). Without increase of L, we vertically raise 
all M € ADEE to the level Eo, and finally move them right to & which results 
in decreasing L,. 
Now we are only left with the task of determining (20.6) which, by (20.15), 
consists in solving the equation 1 — €?/r? = 1 — r, or, equivalently, 
min{r7e,e4} =r. (20.16) 
If er < ae then the graphs of both sides of (20.16) cross each other at level €4, and 
the solution is ey! >. Otherwise they meet below e4 for r = es! 3. These conclusions 
establish the assertion of Theorem 20.1. 


20.4 Concluding Remarks 


A natural extension of the above problem consists in analyzing distributions tending 
to points different from zero. However, by reference to Anastassiou and Rychlik 
(1999),|[86], in this case one can hardly expect obtaining final results in form of nice 
explicit formulae. Another question of interest is the Prokhorov radius described 
by other moments. Also, one can replace Tchebycheff systems of specific powers by 
elements of general families of functions, e.g. convex and symmetric ones. A next 
step of this study is determining radii of classes described by moment conditions 
in other metrices which induce the topology of weak convergence (see Anastassiou 
(1987),[18], Anastassiou and Rachev (1992),/83]). Comparing rates of convergence 
of radii of given classes of measures in various metrices would shed some new light 
on mutual relations of the metrices. 


Chapter 21 


Precise Rates of Prokhorov Convergence 
Using Three Moment 
Conditions 


In this chapter we consider families of life distributions with the first three moments 
belonging to small neighborhoods of respective powers of a positive number.For 
various shapes of the neighborhoods, we determine exact convergence rates of their 
Prokhorov radii to zero. This gives a refined evaluation of the effect of specific 
moment convergence on the weak one. This treatment relies on [90]. 


21.1 Main Result 


We start with mentioning the notion of the Prokhorov distance a of two Borel 
probability measures pu, v defined on a Polish space with a metric p: 


m(u,v) =inf{r > 0: w(A) <v(A") +7, (A) < WA") +97 


for all closed sets A}, (21.1) 


where A” = {a : p(x,A) < €}. The Prokhorov distance generates the weak 
topology of probability measures. In particular, if we consider probability measures 
on the real axis with the standard Euclidean metric and v = 6, is a Dirac measure 
concentrated at point a, then (21.1) takes on the form 


T(u, 6a) =inf{r>0: wU,) >1—r} 


with [, = [a—r,a+r]. For fixed positive a and €;, 2 = 1,2,3, we consider the fam- 
ily M(E), € = (€1, €2,€3), of all probability measures supported on [0,+co) with 
finite first, second and third moments m; = ft’ u(dt), i = 1,2,3, respectively, such 
that |m; — "| < 6, i = 1,2,3. The probability distributions concentrated on the 
positive half line are called the life distributions. The first standard moments pro- 
vide a meaningful parametric description of statistical properties of distributions: 
location, dispersion and skewness, respectively. Note that functions ¢t’, i = 1,2,3, 
form a Chebyshev system on [0,-+co). The Chebyshev systems are of considerable 
importance in the approximation theory, because they allow one to estimate inte- 
grals of arbitrary continuous functions by means of respective expectations of the 
elements of the system (see Karlin and Studden (1966) ,{223]). The objective of this 
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chapter is to establish the exact evaluation of the Prokhorov radius 


II(e) = sup m(p, da) (21.2) 
HEM(E) 

of the parametric neighborhood M(E€) of the degenerate measure J,. We are moti- 
vated by the problem of describing the rate of uniform weak convergence of measures 
with respect to that of its several moments. We therefore suppose that all €; are 
small in comparison with a. However, it is worth pointing out that our results 
are not asymptotic and we derive precise value of (21.2) for fixed nonzero e;. Pre- 
sentation of some asymptotic approximations will follow Theorem 21.1 containing 
the main result. Note that «; — 0, 7 = 1,2, implies that the respective measure 
converges weakly to 6,. Anastassiou (1992),[19], calculated the Prokhorov radius 
of the class of measures satisfying two moment bounds |m; — ‘| < e;, i= 1,2. The 
Levy and Kantorovich radii of the class were determined in Anastassiou (1987) ,[18] 
and Anastassiou and Rachev (1992),{83], respectively. In the last, the Prokhorov 
radius problem for the Chebyshev system {sint, cost} on [0,27] was also solved. 
Anastassiou and Rychlik (1997),[88], analyzed the Prokhorov radius of classes of 
measures surrounding zero satisfying first, second and fourth moments conditions. 
Below we give a refinement of the result by Anastassiou (1992),[19]. 


Theorem 21.1. [f0 < II(E) < a* = min{a, 1}, then it can be determined by means 
of one of the following algorithms: 


(i) If 
2aey +e <1, (21.3) 
€3 > 3a%e, + 3ae2 — (2ae, 4 €o)*/3e1, (21.4) 
then 
T(E) = (2ae, + €2)1/8. (21.5) 
(it) Assume that there is r = r(a,€1,€3) € (0,a*) such that 
(3a? + r7)ey + €3 = ar?(14+ 2r) — 2(1 — r)[(a? + r?/3)9/? — a}. (21.6) 
If 
ler + (1 —r)[(a? + 17/3)? — al] <r’, (21.7) 
€9 > |2ae, + 2a(1 — r)[(a? + r2/3)1/? — aj — (1 + 2r)r?/3], (21.8) 


then II(E) = r(a, é1, €3). 
(iti) Assume that there is r = r(a,€1,€3) € (0,a*) such that 


(a—ar+r?—e)? = (1—r)?[a? —r(a — 1)? +3). (21.9) 
If 

r? —(1—r)[(a? + r2/3)/? — a] <a, <r’, (21.10) 

€ > |(a—ratr’? —e)/—r) +r(a—r)? — a? |, (21.11) 
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then II(E) = r(a, €1, €3). 
(iv)Assume that there is r = r(a, €2,€3) € (0,a*) such that 


(3a? + 7? Jeo + 2aeg = 3a7r? — (1 + 2r)r*/3 — (1 —r)r®/(27a7). 
Tf 


leg + (24 r)r? /3 + (1 r)r‘ /(9a?)| < 2ar?, 
ae, > eg +r? + (1 —r)r*/(9a7), 

then II(E) = r(a, €2, €3). 

(v) Assume that there is r = r(a, €2,€3) € (0,a*) such that 


[a—r(a —r)? — &)]? = (1— ria? — r(a—7r)? + 3)’. 


Tf 
2ar? — (24 r)r?/3 — r*/(9a?) < eg < 2ar? — r°, 
€.>a-ratr? —(1—r)/?[a? — r(a— 1)? — e]'”, 


then II(E) = r(a, €2, €3). 
(vi)Assume that there is r = r(a, €1, €2,€3) € (0,a*) such that 


(1 —r)¥/?/(3a? — r?)e, + 3a€9 — €3] = (r? — 2ae, — €2)(2ae, + € — r*)1/?, 


If 
2ae, + €2 < r?, 
la? 9? + Qala? he? (3)/71e + Da + (@ 477/83) le 
—[(a? + r7/3)1/? — alr? < eg < (307 — r*)e, + 3a8€2, 


then II(E) = r(a, €1, €2, €3). 
(vii)Assume that there is r = r(a, €1, €2,€3) € (0,a*) such that 


(1 —r)/?/(3a? — r?)e, — 3ae — €3] = [r? — 2ae, + €9](2ae1 — €2 — °)1/?. 


Tf 
2ae, — r? < €2 
< [(a? + 1?/3)!? +a — rey — [(a2 + 12/3)? — aly, 
(3a? — r?/3)e, + [Ba + r?/(3a)Je2 — r4/(3a) 
< e < (a? — 17? 4 Qa(a? +77/3)/7]e — (20 + (a? +177/3)7 Je 
—[(a? + 72/3)"/? — alr?, 
then II(E) = r(a, €1, €2, €3). 


271 


(21.12) 


(21.13) 
(21.14) 


(21.15) 


(21.16) 
(21.17) 


(21.18) 


(21.19) 


(21.20) 


(21.21) 


(21.22) 


(21.23) 


We have only considered the Prokhorov radii that satisfy I(€) < min{a,1} 
which is of interest in examining the rate of convergence. The conditions for II(E) = 
1 < acan be concluded from the proof of Theorem 21.1. The case a < I(€) < 1 can 
be analyzed by means of tools used in our proof. The solution has a simpler form 
than that of Theorem 21.1, but we do not include it here. Although our formulas for 
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determining the Prokhorov radius look complicated at the first glance, they provide 
simple numerical algorithms. Case (i) has an explicit solution. In the remaining 
ones, one should determine solutions of polynomial equations (21.6), (21.9), (21.12), 
(21.15), (21.18) and (21.21) in r of degree from 6 in case (iii) to 9 in case (v) with 
fixed a and ¢€;, i = 1,2,3, and check if they satisfy respective inequality constraints. 
If the Prokhorov radius is determined by one of the polynomial equations, there are 
no other solutions to the equations in (0, a*). 

One can establish that (21.9) and (21.15) can be used in describing arbitrarily 
small Prokhorov radii which is of interest in evaluating convergence rates iff a < 
1/6 and 1/3, respectively. For 1/6 < a < 1—3¥V3/8 and 1/3 < a < 37/64, 
respectively, the formulae enable us to determine Prokhorov radii belonging to some 
right neighborhoods of a that are separated from 0. For larger a, (21.9) and (21.15) 
are useless. The remaining five cases allow us to determine H(€) in the whole 
possible range (0,a*) for arbitrary a > 0. 

In comparison with analogous two moment problems, Theorem 21.1 reveals 
abundant representations of Prokhorov radius, depending on relations among «;, 
¢=1,2,3. Formula (21.5) coincides with the Prokhorov radius of the neighborhood 
of 6, described by conditions on the first and second moments only (cf Anastassiou 
(1992),[19]). It follows that if the conditions on the third one are not very restric- 
tive (see 21.4), then they are fulfilled by the measures that determine the Prokhorov 
radius in the two moment case. One can also confirm that equations (21.6), (21.9), 
and (21.12), (21.15) determine the Prokhorov radii for classes of measures with 
other choices of two moment conditions. The former two refer to the first and third 
moments case and the latter to that of the second and third moments, respectively. 
Again, (21.8), (21.11), and (21.14), (21.17) indicate that admitting sufficiently large 
deviations of the second and first moments, respectively, we obtain the Prokhorov 
radii depending only on the remaining pairs. One can deduce from our proof that 
cases (iti) and (v) refer to smaller values of €3/e€1 than cases (ii) and (iv), respec- 
tively. Using (21.10) and (21.16) we can find some lower estimates II(€) > a 
and II(€) > (€2/2a)'/? for the Prokhorov radius determined by equations (21.9) 
and (21.16), respectively. In the last two cases the radius depends directly on all 
€;, 7 = 1,2,3, and the respective inequality constraints represent upper bounds for 
the range of each €;. We can find some explicit evaluations of II(€) there making 
use of the fact that all factors in (21.18) and (21.21) are positive. E.g., from (21.18) 
and (21.21) we conclude that 


(Qae, + €2)/? < 2 Qe), (21.24) 


A 
= 
oS 


1/2 


A 
= 
cs 


(2ae, — €2) < (2ae, — €2)1/8, (21.25) 


(cf (21.5)). 

In cases (#7) and (iv) there are no simple estimates of II(€) in terms of €; and €2 
that could be compared with (21.5), (21.24) and (21.25), because either €2 or €, can 
be arbitrarily large then. Nevertheless, assuming that the parameters €; are small 
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and so is II(E), we can solve the approximates of (21.6) and (21.12) neglecting all 
powers of r except of the lowest ones. This produces 


3a7e1 + €3 
3a7€q + 2ae3 tee 
ne)» (*S5=ES) 


respectively. Analogous approximations of (21.9), (21.15), (21.18), and (21.21) 
derive 


a? + €3 — (a—«)? 


U ee Nee 
(€) 6a2e, — 3ae7 + 2€3” 
me) x ~ete ea 
6ateg — 30763 + 4ate3 — €3” 
(2ae; + €2)3 
H(é) = L|-———_ 
(€) (3a2€, + 3a€2 — €3)?” 
2 ES 3 
ite) woe) 


(3a2€, — 3a€2 — €3)2” 


respectively. 
21.2 Outline of Proof 
The proof of Theorem 21.1 is lengthy for full and complete details see,[86] and 


Chapter 22 next. Below we only sketch the main ideas. A general idea of proof of 
Theorem 21.1 comes from Anastassiou (1992),[19]. We first determine 


L,(M) = inf{p(I,) : i: t’ u(dt) = m;, i= 1,2,3} (21.26) 
for fixed 0 <r <a and all ; 
M = (m1,m2,m3) EW={M: [ t’ u(dt) = m;, i= 1,2, 3}. (21:27) 
Thus 
IM(€) = inf{r >0: £,(€) > 1-7}, (21.28) 
where 
L,(E) = inf{L,(M): M € B(E) NW}, (21.29) 


B(E)= {M=(mi,me2,m3): |mi—a"| <j, i=1,2,3}. (21.30) 


Further rectangular parallelepipeds (21.30) will be called shortly boxes. The advan- 
tage of the method is that having solved (21.26), we can further analyze function 
L,(M) of four real parameters rather than the original problem (21.2) with an in- 
finitely dimensional domain. We shall see that the function is continuous although 
it is defined by eight different formulae on various parts of W. In particular, it 


274 PROBABILISTIC INEQUALITIES 


follows that the infimum in (21.29) is attained at some M = M,¢ for every fixed r. 
Moreover, M;¢ changes continuosly as we let r vary, which yields the continuity of 


L£,(E) = L,(Mre) (21.31) 
in r, and accordingly (21.28) is a solution to the equation 
L,(Mre) = £(€) =1—r. (21.32) 


The proof of Theorem 21.1 consists in determining all M,¢ that satisfy both re- 
quirements of (21.32) for all € such that 0 < r = II(E) < a*. It occurs that the 
problem has seven types of solutions, depending on mutual relations among e; and 
r. These provide seven respective statements in Theorem 21.1. In Theorem 21.2 
we solve auxiliary moment problem (21.26) for all M of the moment space (21.27) 
and arbitrary fixed r < a. In the proof we use arguments of the geometric moment 
theory developed in Kemperman (1968),[229], in a general setup and some results of 
Rychlik (1997),[314], useful in examining three moment problems. The conclusions 
of Theorem 21.2 are applied in Theorem 21.3 for determining all moment points 
such that (21.31) holds for some r and €. Then the proof of Theorem 21.1 consists 
in selecting the points that additionally satisfy (21.32). 
It is widely known that (21.27) has the explicit representation 


W ={M = (m1,m2,m3) : m, >0, m2 > mz, mym3 > m3} (21.33) 


(cf e.g., Karlin and Studden (1966) ,[223]). Theorem 21.2 below states that (21.26) 
has eight different representations in different subregions of (21.33). We define the 
subregions using geometric notions. These are both more concise and illustrative 
than analytic ones that involve complicated inequality relations in m;, 7 = 1,2,3. 
We first introduce some notation. We shall use small letters for points on the line 
and capital ones for those in the three-dimensional space, e.g., M = (m1,m2,ms3) € 
R°. Pieces of the graph of function t + (t,t?,t3), t > 0, will be denoted by 
SV = {T = (t,t?,t3) : s < t < v}. We distinguish some specific points of 
the curve: O, A, B,C,A,,A2, and D are generated by arguments 0, a, b= a-—r, 
c=atr, a, = (a? + 7?2/3)/?, ag = a+r?/(3a) and d = [2a + (a? + 3r?)/?]/3, 
respectively. We shall write conv{-} for the convex hull of a family of points and/or 
sets, using a simpler notation ST = conv{S,T} and ASTV = conv{S,T, V} for line 
segments and triangles, respectively. The plane spanned by S,7,V will be written 
as pl{STV}. We shall also use a notion of membrane spanned by a point and a 
piece of curve 


mem{V, SU} = U TV. 
s<t<u 
Note that 
conv{V,W,SU}= |) ATVW. (21.34) 


s<t<u 
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At last, for a set A we define the family A! of points lying above A by 
Al = {(m1,m2,m3 +t): (m1,m2,m3) € A, t > 0}. 
Observe that (21.33) is alternatively described by 
W = mem{O, Occ}! = conv{ Ooo}. (21.35) 


We can now formulate the solution to our auxiliary moment problem (21.26). In 
some cases, when this is provided by a unique measure, we display the respective 
representations. 


Theorem 21.2. For fixed 0 <r <a and M = (m1,m2,m3) € W, (21.26) can be 
explicitely written as follows. 


(i) If M € Vp = mem{O,OB'} U AOBCT U mem{O, Goo} ", then 
L,(M) = 0. (21.36) 
(ii) If M € AABC, then 
L,(M) = 1— (m2 — 2am, + a?) /r?. (21.37) 
(iti) If M € mem{C, AC}", then 


(c= m1)? 
L,(M) = —-———_-.. 21.38 
(M) c? — 2em, + m2 ( ) 
(iv) If Me mem{ B, BA}", then 
(m1 — b)? 
L,(M) = —-—————.. 21. 
( ) m2 — 2bm4 + b? ( 3) 
(v) If M € conv{O, B, D,C}, then 
m3 + (b+ c)m2 — bem, 
L,(M) = 21.40 
mM) d(d — b)(c — d) ( ) 
This is attained by a unique measure supported on 0,b,d, and c. 
(vi) If M € conv{O,C, DC}, then 
= = 3 
TTR GD (21.41) 
t(c — t) (cmz — m3)(c?m4 — 2cm2 + ms) 
This is attained by a unique measure supported on 0,c, and 
pee ea a, (21.42) 
cm, — m2 
(vii) If Me conv{O, B, BD}, then 
my — bmy4 (m2 — bm)? 
L,(M) = ————_ = J 21.43 
( ) t(t = b) (ms = bm2)(ms ome 2bma + b?my1) ( ) 
This is attained by a unique measure supported on 0,b, and 
—0b 
j= 23 e bd. (21.44) 


mg. — bm 
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(viii) If M € conv{B,C, AD}, then 
—m2 + (b+ c)m, — bc 
(t — b)(c— t) 
_ (—m2 + (b+ c)m, — bc)3 
~ [-m3 + (2b + c)m2 — b(b + 2c)m, + b?¢] 
1 


L,(M) = 


Ts (hl DA pen Obie-Alrny Hope 21.4 
x [m3 — (b+ 2c)m2 + c(2b+ c)m, — bc?) ( 5) 


This is attained by a unique measure supported on b,c, and 


m3 + (b+ c)mg — bem, 
ee dj. 21.4 
: M2 +4 (b t c)my, bc : la, ( ) 


One can see that for every 0 < r < a* there exists M such that L,(M) = 1—r (cf 
(21.32)) in each region of the partition of the moment space presented in Theorem 
21.2, except of the first one. E.g., the equation holds true for M € AABC' iff 
Me BACA' with B4 = rB+(1—r)A and CA = rC+(1-r)A. In conv{B, C, AD}, 
this is fulfilled by M € U,<;<q BEC. 

In Theorem 21.3 we describe all moment pints M,¢ at which the minimal positive 
value of function (21.26) over boxes B(E) is attained for arbitrary 0 < r < a* and 
e; > 0,1=1,2,3. Let Vi = (a— 1,07 + €2,02 + €3), Vo = (a@— 1,07 — €2, 0° + €3), 
V3 = (a+ &1,a7 — €2,a° + €3) denote three of vertices of the top rectangular side 
R = {(mi, m2, a° + €3): |m; —a’| < €;, 1= 1,2} of B(E). 


Theorem 21.3. For all 0 < r < a* and each of the following cases: 


Mre = Vi € AABCT,” (21.47) 

{Myre} = ViV2  ABA\C, (21.48) 

{Myre} = ViV2Nmem{B, AAj}, (21.49) 

{Mre} = VaV3 DABA2C, (21.50) 

{Mre} = VaV3n mem{B, AAd}, (21.51) 

Mre = V; € conv{ B,C, AA}, (21.52) 

Mre = Vo € conv{ B,C, AAd}, (21.53) 

there exist €; > 0, 71 =1,2,3 such that 

LE) = Ly(Mre) > 0. (21.54) 


The proof of Theorem 21.1 relies on the results of Theorem 21.3. We consider 
various forms (21.47)- (21.53)of moment points solving (21.31) and indicate ones 
that satisfy (21.32) for some r < a*. Solutions to (21.32) with respect to r derive 
values of Prokhorov radius for various combinations of €;, 1 = 1, 2,3. 


Chapter 22 


On Prokhorov Convergence of Probability 
Measures to the Unit under Three 
Moments 


In this chapter for an arbitrary positive number, we consider the family of prob- 
ability measures supported on the positive halfaxis with the first three moments 
belonging to small neighborhoods of respective powers of the number. We present 
exact relations between the rate of uniform convergence of the moments and that 
of the Prokhorov radius of the family to the respective Dirac measure, dependent 
on the shape of the moment neighborhoods.This treatment follows [86]. 


22.1 Main Result 


We start with by mentioning the notion of the Prokhorov distance 7 of two Borel 
probability measures jy, v defined on a Polish space with a metric p: 


7™(H,¥) = 


inf {r >0: p(A) <v(A") +7, v(A) < w(A") +7 for all closed sets A}, (22.1) 


where A” = {x : p(x,A) < e}. The Prokhorov distance generates the weak topol- 
ogy of probability measures. In particular, if we consider probability measures on 
the real axis with the standard Euclidean metric and v = 6, is a Dirac measure 
concentrated at point a, then (22.1) takes on the form 


T(t, 0a) =inf {r >0: u(l-) >1—r}, 


with I, = [a—r,a+r]. For fixed positive a and €;, i = 1,2,3, we consider the 
family M(E), E = (€1, €2, €3), of all probability measures supported on [0,-+oo) with 
finite first, second, and third moments m; = f t’ (dt), i = 1,2,3, respectively, such 
that |m; —a'| < €;, i = 1,2,3. The probability distributions concentrated on the 
positive halfline are called the life distributions. The first standard moments pro- 
vide a meaningful parametric description of statistical properties of distributions: 
location, dispersion, and skewness, respectively. Note that functions t*, i = 1,2,3, 
form a Chebyshev system on [0,-++co). The Chebyshev systems are of considerable 
importance in the approximation theory, because they allow one to estimate inte- 
grals of arbitrary continuous functions by means of respective expectations of the 
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elements of the system (see [223]). The objective of this chapter is to give the exact 
evaluation of the Prokhorov radius 

IE) = sup m(p, da) (22.2) 

weM(E) 

of the parametric neighborhood M(E) of the degenerate measure J,. We are inspired 
by the problem of describing the rate of uniform weak convergence of measures with 
respect to that of its several moments. We therefore, suppose that all €; are small 
in comparison with a. However, it is worth pointing out that the results are not 
asymptotic and we derive precise value of (22.2) for fixed nonzero €;. Presentation 
of some asymptotic approximations will follow Theorem 22.1 containing the main 
result. Note that «; — 0, 7 = 1,2, implies that the respective measure converges 
weakly to dq. Amnastassiou [19] calculated the Prokhorov radius of the class of 
measures satisfying two moment bounds |m; — a*| < ¢;, i = 1,2. The Levy and 
Kantorovich radii of the class were determined in [18,83], respectively. In the latter, 
the Prokhorov radius problem for the Chebyshev system {sin t, cost} on [0,27] was 
also solved. Anastassiou and Rychlik [88] analyzed the Prokhorov radius of classes of 
measures surrounding zero satisfying first, second, and fourth moments conditions. 
Below we give a refinement of the result by Anastassiou [19]. 


Theorem 22.1. [f0 < II(E) < a* = min{a, 1}, then it can be determined by means 
of one of the following algorithms. 


(i) If 
2ae, +e2 <1, (22.3) 
€3 > 3a%e, + 3aeg — (2ae, + ee €1, (22.4) 
then 
1/3 
II(E) = (2a€, + €2) : (22.5) 


(it) Assume that there is r = r(a,€1,€3) € (0,a*) such that 


2\ 3/2 
(3a? +r) ey + €3 = ar?(1 + 2r) — 2(1— 1) (« + 7) = °| . (22.6) 


If 


Spe, (227) 


2\ 1/2 2 
(e+5) 4 = (1+2r)> 


€2 > |2ae, + 2a(1 — r) ; (22.8) 


then II(E) = r(a, é1, €3). 
(wit) Assume that there is r = r(a,€1,€3) € (0,a*) such that 


(a-—ar+r?— ay =(1-r)? [a -r(a—r)? +63]. (22.9) 
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If 
jo 
r?—(1—r) (e+5) 4 <e <r’, (22.10) 
—ratr?— 
cy > CTPAAT AD 5 og nya], (22.11) 
(1—r) 
then II(E) = r(a, €1, €3). 
(iv) Assume that there is r = r(a, €2,€3) € (0,a*) such that 
4 6 

(3a? +r?) €2 + 2ae3 = 3a7r? — (14 ar) - (1 ") Ora)" (22.12) 

Tf 

| | re | ie 2 
€2+ (24 i) + (1—r) (00) < 2ar*, (22.13) 
4 
Qaey > eg +r? + (1-71) (002)’ (22.14) 
then II(E) = r(a, €2, €3). 
(v) Assume that there is r = r(a, €2,€3) € (0,a*) such that 

[a—r(a—r)? - Z| =(1—r) [a® —r(a—r)* 4 Ee (22.15) 

Tf 

a rt 

Zar” — (2+ Die ~ Oa) < €2 < 2ar*—r’, (22.16) 
€ >a-—ratr?—(1—r)? [a* —r(a—r)? —e9] ie : (22.17) 


then II(E) = r(a, €2, €3). 
(vi) Assume that there is r = r(a, €1, €2,€3) € (0,a*) such that 


(1 - r)i/2 [(3a? r?) €1 + 3a€9 €3| = (r? — 2ae, — €2) (2ae + eg — yi? : 
(22.18) 
If 

Qae, + €2 <1’, (22.19) 

pe 1/2 ne 1/2 

@—P 420(a+5) €y+ a+ (+>) €2 
2 1/2 

- (« + 7) - ‘ r<eg< (3a? — r?) €, + 3ae2, (22.20) 


then II(E) = r(a, €1, €2, €3). 
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(vii) Assume that there is r = r(a, €1, €2,€3) € (0,a*) such that 


(1 - r)/? [(3a7 r?) €1 — 3a€q €3| = (r? —2ae, + €2) (2ae —€,— yi? F 


(22.21) 
If 


2\ 1/2 
<e< evan (ot 2) 


2 1/2 
(« + 5) — ‘ r?, (22.23) 


then II(E) = r(a, €1, €2, €3). 


We have only considered the Prokhorov radii that satisfy I(€) < min{a,1} 
which is of interest in examining the rate of convergence. The conditions for II(€) = 
1 < acan be concluded from the proof of Theorem 22.1. The case a < II(E) < 1 can 
be analyzed by means of tools used in our proof. The solution has a simpler form 
than that of Theorem 22.1, but we do not include it here. Although the formulas for 
determining the Prokhorov radius look complicated at the first glance, they provide 
simple numerical algorithms. Case (i) has an explicit solution. In the remaining 
ones, one should determine solutions of polynomial equations (22.6), (22.9), (22.12), 
(22.15), (22.18), and (22.21) in r of degree from 6 in Case (iii) to 9 in Case (v) with 
fixed a and ¢€;, i = 1,2,3, and check if they satisfy respective inequality constraints. 
If the Prokhorov radius is determined by one of the polynomial equations, there are 
no other solutions to the equations in (0, a*). 

In comparison with analogous two moment problems, Theorem 22.1 reveals 
abundant representations of Prokhorov radius, depending on relations among «,, 
i= 1,2,3. Formula (22.5) coincides with the Prokhorov radius of the neighborhood 
of 5q described by conditions on the first and second moments only (cf. [19]). It 
follows that if the conditions on the third one are not very restrictive (see (22.4)), 
then they are fulfilled by the measures that determine the Prokhorov radius in the 
two moment case. One can also confirm that equations (22.6), (22.9) and (22.12), 
(22.15) determine the Prokhorov radii for classes of measures with other choices of 
two moment conditions. The former two refer to the first and third moments case 
and the latter to that of the second and third moments, respectively. Again, (22.8), 
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(22.11) and (22.14), (22.17) indicate that admitting sufficiently large deviations of 
the second and first moments, respectively, we obtain the Prokhorov radii depending 
only on the remaining pairs. One can deduce from our proof that Cases (iii) and (v) 
refer to smaller values of €3/€, than Cases (ii) and (iv), respectively. Using (22.10) 
and (22.16) we can find some lower estimates II(E) > ey/* and II(€) > (e2/2a)*/? 
for the Prokhorov radius determined by equations (22.9) and (22.16), respectively. 
In the last two cases, the radius depends directly on all €;,2 = 1,2,3, and the respec- 
tive inequality constraints represent upper bounds for the range of each €;. We can 
get some explicit evaluations of II(€) there making use of the fact that all factors in 
(22.18) and (22.21) are positive. For example, from (22.18) and (22.21) we conclude 
that 


(Goe Fes) SNE) Se Coaten’, (22.24) 


Cg =e)" Se) Sede ey" (22.25) 


(cf. (22.5)). 

In Cases (ii) and (iv) there are no simple estimates of II(€) in terms of €, and €2 
that could be compared with (22.5), (22.24) and (22.25), because either €2 or €, can 
be arbitrarily large then. Nevertheless, assuming that the parameters €; are small 
and so is II(€), we can solve the approximates of (22.6) and (22.12) neglecting all 
powers of r except of the lowest ones. This yields 


3a7e1 + €3 


II x1 
(é) aS 


3a7€q + 2ae3 ue 
Il(€) & | —>——— 
@)~ (Sos) 
respectively. Analogous approximations of (22.9), (22.15), (22.18), and (22.21) give 
Ie) « 2 tesla a) 
6a2€, — 3aef + 2e3’ 


m(€) d: (a? + €3)" _ (a? _ €)° 
~ 6ateEs 3076 + dates — 3’ 


(2ae, + es)° 


11S Se, 
©) (3a7e1 + 3a€2 — €3)° 


me) @1— —2aerree)” 
(3a7e1 — 3a€2 — eae 


respectively. 
A general idea of proof of Theorem 22.1 comes from [19]. We first find 


L,.(M) int { u(t) [Oe mcan =m,i= ia}, (22.26) 
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for fixed 0 <r <a and all 


M = (m1,m2,m3) € W = {a : | t' (dt) =m, i= 12,3}. (22.27) 
0 
Then 
IN(€) =inf{r >0:2L,(€) >1-r}, (22.28) 
where 
L,(E) = inf {L,(M) : M € B(E)NW}, (22.29) 
B(E) = {M = (mi, ma,ms3) : |mi — a’| < e, 1 = 1,2,3}. (22.30) 


Further rectangular parallelepipeds (22.30) will be called shortly boxes. The ad- 
vantage of the method is that having solved (22.26), we can further analyze func- 
tion L,(M) of four real parameters rather than the original problem (22.2) with 
an infinitely-dimensional domain. We shall see that the function is continuous al- 
though it is defined by eight different formulae on various parts of W. In particular, 
it follows that the infimum in (22.29) is attained at some M = M,¢ for every fixed r. 
Moreover, M,¢ changes continuously as we let r vary, which yields the continuity 
of 


L,(E) = Ly (Mre) (22.31) 
in r, and accordingly (22.28) is a solution to the equation 

L, (Mye) = £L,(E) =1 =r. (22.32) 
The proof of Theorem 22.1 consists in determining all M;,¢ that satisfy both re- 
quirements of (22.32) for all € such that 0 < r = II(E) < a*. It occurs that the 
problem has seven types of solutions, depending on mutual relations among e; and 

r. These provide seven respective statements in Theorem 22.1. 
In Section 22.2, we solve auxiliary moment problem (22.26) for all M of the 
moment space (22.27) and arbitrary fixed r < a. In Section 22.3, we analyze 
variability of the solution over different subregions of the moment space. The results 


are applied in Section 22.4 for determining all moment points such that (22.31) holds 
for some r and €. In Section 22.5, we choose those that additionally satisfy (22.32). 


22.2 An Auxiliary Moment Problem 


It is widely known that (22.27) has the explicit representation 

W= {M = (m1,M2,m3) :m1 > 0, m2 > mi, mms > m3} (22.33) 
(cf., e.g., [314]). Theorem 22.2 below states that (22.26) has eight different represen- 
tations in different subregions of (22.33). We define the subregions using geometric 


notions. These are both more concise and illustrative than analytic ones that in- 
volve complicated inequality relations in m,;, 1 = 1,2,3. We first introduce some 
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notation. We shall use small letters for points on the line and capital ones for those 
in the three-dimensional space, e.g., M = (m1,™m2,™m3) € R®. Pieces of the graph of 
function t + (t,t?,t3), t > 0, will be denoted by SV = {T = (t,#?, 13): 5 <t<v}. 
We distinguish some specific points of the curve: O, A, B, C, Ai, Ag, and D are gen- 
erated by arguments 0, a, b = a—r, c= a+r, a; = (a? +1?/3)"/?, ag = a+r?/(3a) 
and d = [2a + (a? + 3r?)!/]/3, respectively. We shall write conv{-} for the convex 
hull of a family of points and/or sets, using a simpler notation ST = conv{S,T} 
and ASTV = conv{S,T, V} for line segments and triangles, respectively. The plane 
spanned by S,T,V will be written as pl{STV}. We shall also apply a notion of 
membrane spanned by a point and a piece of curve 


mem {V, SU} = U TV. 


s<t<u 
Notice that 
conw {v, w, 50} = J arvw. (22.34) 
s<t<u 


Finally, for a set A we define the family A’ of points lying above A by 
Al = {(m1,m2,m3 + t) : (m1,m2,m3) € A, t > 0}. 
Observe that (22.33) is alternatively described by 
W = mem{O, Ooo}! = conv {Ooo} . (22.35) 


We can now formulate the solution to our auxiliary moment problem (22.26). In 
some cases, when this is provided by a unique measure, we display the respective 
representations, because they will be further used in our study. 


Theorem 22.2 For fired 0 <r <a and M = (m1,m2,ms) € W, (22.26) can be 
explicitly written as follows. 
(i) If M EV, =mem{O, OB}! U AOBC! Umem{O, Coo}!, then 


L,(M) =0. (22.36) 
(ii) If M € AABC', then 


(m2 — 2am, + a) 


L,(M) =1— : (22.37) 
r 

(iti) If M € mem{C, AC}", then 

L,(M) = _(e=m)* (22.38) 
; ~ C2 —2em, + me’ : 
(iv) If M € mem{B, BA}, then 
2 

Lays») (22.39) 


my — 2bm, + b2° 
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(v) If M € conv{O, B, D,C}, then 


—ms3 + (b+ c)m2 — bem 


L,(M) = 22.40 
mM) d(d — b)(c — d) ( ) 
This is attained by a unique measure supported on 0, b, d, and c. 
(vi) If M € conv{O,C, DC}, then 
3 
cm, — M2 (em 7 m2) 
L,(M) = —— = tT oo _.. 22.41 
mt t(c — t) (emz — mg) (c2my — 2cem2 + m3) ( ) 
This is attained by a unique measure supported on 0,c, and 
fi ES ig oh (22.42) 
cm, — m2 
(vii) If Me conv{O, B, BD}, then 
my — bm, (m2 — bm)° 
L,(M) = ———— = a. 22.43 
( ) t(t a b) (ms 7 bm2) (ms = 2bm2 + b?m1) ( ) 
This is attained by a unique measure supported on 0,b, and 
—b 
po 28’ ei. (22.44) 
ms — bm4 
(viii) If Me conv{ B,C, AD}, then 
Linh mz + (b+c)m, — be 
(t — b)(c—t) 
(—m2 + (b+ c) mi — bc)? 
[—ms3 + (2b + c)me2 — b(b+ 2c)m + bc 
x A (22.45) 
[m3 — (b+ 2c)m2 + c(2b+ c)m — be?] | ; 
This is attained by a unique measure supported on b, c, and 
+ (b4 b 
fas SSE Ca OO 24 (22.46) 


—m2 + (b+ c)m, — bc 


In order to simplify further analysis of (22.37)—(22.46), we represented them so 
that all factors of products and quotients are nonnegative there. Also, we defined 
the subregions of the moment space as its closed subsets. One can confirm that the 
formulae for L,(M) for neighboring subregions coincide on their common borders. 
Because each formula is continuous in M in the respective subregion, so is L,(M) 
in the whole domain. Observe also that letting r vary, we change continuously 
the shapes of subregions. Combining this fact with the continuity of each above 
formulae with respect to r, we conclude that L,(/) is continuous in r as well. 

The proof is based on the geometric moment theory developed in [229] in a 
general setup. The theory provides tools for determining extreme integrals of a 
criterion function over the class of probability measures such that the respective 
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integrals of some other functions satisfy some equality constraints. Kemperman 
[229] used the fact that each measure with given expectations of some functions has 
a discrete counterpart with the same expectations such that its support contains 
at most one more point than the number of conditions, and replaced the original 
problem by analyzing the images of integrals which coincide with convex hulls of the 
respective integrands. E.g., in view of the theory the latter representation in (22.35) 
is evident. For the special case of indicator criterion function that we examine here, 
Kemperman [229] described an optimal ratio method for solving respective moment 
problems. A version of the method directly applicable in the problem is presented 
in Lemma 22.3. It is obvious that L,(M) = 0 iff M € V, which denotes here the 
closure of the family of moment points convf{ OB, Coo} for the measures supported 
on the complement of J. The optimal ratio method allows to determine L,(M) for 
other elements of the moment space (22.27). 


Lemma 22.3. Let H be a hyperplane supporting the closure of the moment space 
and let H' be the closest parallel hyperplane to H supporting V,. Then for all 
M € conv{WNH, V>NH’S, yields 


_ p(H',M) 


Ee(M) = ag Hy” 


(22.47) 


where the numerator and denominator stand for the distances of hyperplane H' from 
moment point M and hyperplane H, respectively. 


Since explicit determining the supporting hyperplanes poses serious problems 
especially in multidimensional spaces, we also apply some other tools useful for 
studying three moment problems. Lemmas 22.4 and 22.5 give adaptations of aux- 
iliary results of Rychlik [314] to our problem. 

Lemma 22.4. Let 0 < ti < tg < t3 < ta. Then T; = (ti, t?, t?) lies below (above) 


Lea 


the plane spanned by T;, 7 #1, iffi=1 or 3 (2 or 4, respectively). 


Lemma 22.5. A measure attaining L,(M) for a moment point M of an open 
set W! C W, W'N Ooo = 9D, is a mixture of measures attaining L,(M;) for some 
moment points M; of the border of W'. Accordingly, the subset of support points of 
the measures attaining L,(M) for border moment points contains that for the inner 
ones. 


In the rest, we shall repeatedly use analytic representations of moment points 
as combinations of image points T; € Ooo. A four-point representation M = 
a a;T;; has the coefficients 


m3—mM2> >t; +m YO tyth — [] t; 
j#t J#I#Ak J#t 
IT @: — t;) 


Ft 


a; = 1<i,j,k<4. (22.48) 
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3 
Also, M = SO aiT; € p{T), To, T3} with coefficients 
i=l 


J#t j#t am Fe 
a; = ; 1<i,j <8, (22.49) 
I] (@: — ty) 
j#i 
iff it satisfies the equation 
m3 — M2 (ty + to 4 t3) My (tite + tyt3 + tots) — tytotz = 0. (22.50) 


If the left-hand side of (22.50) is less or greater than 0, then M lies below or above 
the plane. Generally, formulae (22.48) and (22.49) represent coefficients of linear 
combinations. If all a; > 0, then the combinations are convex. 


Proof of Theorem 22.2. We first determine V, = conv{OB, Coo}, where L, 
vanishes. We claim that 
VY, = mem{O, OB}! U AOBC' Umem{O, Coo}! 


One can easily check that all M € V, satisfy m; > 0 with m2 > max{mj, (b+c)m1— 
bc}, and we determine the range of the third coordinate. We start with showing 
that this is unbounded from above. If either m; < b or m; > c¢ and mz > m7, then 
(m1, m2) has a convex representation 


m, =at+(l-a)s, (22.51) 


ma = at? + (1—a)s?, (22.52) 
for t > co and s — m, from the left. Substituting s = (m1 — at)/(1 — a) from 
(22.51) into (22.52), we obtain 

at? = mz — m? +2(tm, —m2)a > m2—m? >0 
and 
m3 > at® > (m2 — mj) t > x, as t > oo. 

If b< m1 < cand m2 > (b+ c)m, — be, then (mj, mz) is a convex combination 
of (b, 6”), (c,c?), and (t,t?) for sufficiently large t. By (22.49), the coefficient of t is 
mg — (b+ c)m, + be 

(t — b)(t — c) 
and so m3 > at® — oo, as t > oo. Since V, is a convex body, we need to show that 
mem{O,OB}, AOBC and mem{O, Coo} form its bottom. The surfaces consist of 
moment points generated by 0,t € (0, b], and 0, b, c, and 0,t € [c, 00), respectively, 
which belong to [0,b] U [c,co). Moreover, the membranes are parts of the lower 
envelope of W. Consequently, it suffices to prove that no moment point of a measure 


supported on [0, ] U[c, co) lies beneath AOBC. We can confine ourselves on three- 
point distributions. Due to Lemma 22.5, if0 <t; < bor t; >c,71=1,2,3, then the 


— 
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respective T; lie above AOBC and so do all M € AT,ToT3. This completes the 
proof of Part (i). 

The next three parts are proved by means of the optimal ratio method. Consider 
the vertical hyperplane Hg : mz — 2am, + a? = 0 supporting W along halfline A’. 
Then H!, : mg — (b+ c)m1 + bc = 0 obeys the requirements of Lemma 22.3, and 
HL OAV, = BC. Therefore, for M € conv{ At, BO} = AABC", we have 
p(H!,M) — |m2— (b+ c¢)m, + be| my — 2am, +a? 
ey ee ec eR 
p (Hi, A) ag re 
as stated in (22.37). For proving (22.38), we consider Hy : mz — 2tm, + t? = 0 for 
some t € (a,c). Then HiNW = T"', and Hi, : m2 —2tm + 2te—c? = 0 is the closest 
parallel plane to H; that supports V, along C'. Then for M € conv{T!, C1} = FO. 
we obtain 


L,(M) = 


P(Hi,M) _ |m2 — 2tm, + 2te — c?| 
(Hy, T) (c—t)? 


Condition M € TC! implies mz = (t+ c)m, —tc. This allows us to replace t with 
(em, — m2)/(e — m1) in (22.53) and leads to (22.38). Note that the formula holds 
true for all M € hescro! = mem{C, AC}. In the same manner we can prove 
part (iv) by studying H, for t € (b, a) with H/, : mz —2tm, + 2tb—b? = 0 that touch 
Y, along Bt. 

We are left with the task of determining L,(M) for conv{O, B,C, BC} that 
is bounded below by mem{O, BC}, and above by AOBC, AABC, mem{C, AC} 
and mem{B, BA}. Observe that the optimal convex representations for M from the 
border surfaces are attained for the support points 0 with ¢ € (b,c), and 0, b, c, and 
b, a, c, and c with t € (a,c), and b with t € (b, a), respectively. Due to Lemma 22.5, 
examining L,(M/) for the inner points, we concentrate on the measures supported 
on 0, b, c and some t; € (b,c). 

We claim that the optimal support contains a single t € (b,c). To show this, 
suppose that M € PQ for some P € AOBC and Q € conv{T; : b < t; < c}. All 
T; lie below pl{OBC} (see Lemma 22.4), and so do @ and M. The ray passing 
from P through M towards Q runs down through conv{BC} 3 Q. We decrease 
the contribution of Q which is of interest once we take the most distant point_ of 
conv{ BC}. This is an element of the bottom envelope mem{B, BC} of conv{ BC} 
(cf. [314]). Accordingly, an arbitrary combination of T; € BC can be replaced by a 
pair Band Te BC, which is the desired conclusion. 

For given M € conv{O, B,C, BC}, we now aim at minimizing 


a(t) = 


L,(M) = (22.53) 


—ms3 + (b+ c)m2 — bem 
t(t — b)(c — t) 
(cf. (22.48)) which is the coefficient of T € BC in the linear representation of M 


in terms of O, B, C, and T. If b < t < c, both the numerator and denominator 
of (22.54) are positive, the former being constant. By differentiating, we conclude 


(22.54) 
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that the denominator has two local extremes: a negative minimum at d’ < b and 
a positive maximum at d = [2a + (a? + 3r?)*/?]/3 © (a,c), which is the desired 
solution. We complete the proof of part (v) by noticing that L,(M) = a(d) iff the 
representation is actually convex, i.e., when M € conv{O, B,C, D}. 

Moment points of conv{O,C, DC} with borders mem{O, DC}, AODC and 
mem{C, DC} are treated in what follows. The solutions of the moment problem 
on the respective parts of the border are provided by combinations of 0 with some 
t € [d,c|, and 0, d, c, and c with some t € [d,c], respectively. Referring again 
to Lemma 22.5, we study combinations of 0, d, c and some t € (d,c) when we 
evaluate (22.26) for the inner points. We showed above that combinations with 
a single t reduce p(I,-). Analyzing (22.54) for t > d we see that decreasing t 
provides a further reduction of the measure. However, we can proceed so as long 
as M € conv{O,D,T,C} and stop when M € AOTC. By (22.50), the condition 
is equivalent to m3 — (t+ c)m2 + tem, = 0, which allows us to determine (22.42). 
Combining (22.42) with (22.49), we conclude (22.44) for M € Uje,<, AOTC = 
conv{O, C, DC} (cf. (22.34)). 

What remains to treat is the region situated above mem{O, BD}, ABDC, and 
below mem{C, AD}, AABC, mem{B, BA}, and AOBD. Analysis similar to that 
in the proof of the previous two cases leads to minimization of (22.54) with respect to 
t € (b,d). Fora fixed M, we can reduce (22.54) by increasing t until M belongs to the 
border of the tetrahedron conv{O, B,T,C}. There are two possibilities here: either 
M € conv{O, B, BD}, and then M € AOBT ultimately, or M € conv{B,C, AD}, 
and then M becomes a point of ABCT. Analytic representations of M € pl{OBT} 
and M € pl{ BCT} (cf. (22.50)) makes it possible to write (22.44) and (22.46), 
respectively. Final formulae (22.43) and (22.45) are obtained by plugging respective 
support points into (22.49). Verification of details is left to the reader. 


One observes that for every 0 < r < a* there exists M such that L,(M7) =1-—r 
(cf. (22.32)) in each region of the partition of the moment space presented in 
Theorem 22.2, except of the first one. E.g., the equation holds true for M ¢€ 
AABC' iff M ¢ BACA' with BA =rB4(1—r)A and C4 =rC4(1—n)A. In 
conv{ B, C, AD}, this is satisfied by M € Un<icg BECT. A more difficult problem 
that will be studied in next two sections is determining M that minimize L, over 
boxes (22.30). The moment points that obey both the conditions are needed for 
the final determination of the Prokhorov radius. 


22.3 Further Auxiliary Results 


Lemma 22.6. L,(M) is nonincreasing in m3 as m1,mMg are fixed and M € W. 


Proof. We prove the property for every element of the partition of the moment 
space presented in Theorem 22.2 and refer to continuity of the function for justifying 
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the claim. The result is trivial for moment points in the first five subregions. 

Consider M € conv{O,C, DC}. Note that (22.42) is decreasing in m3, and 
examine the former representation in (22.41). This is the ratio of two positive values: 
the numerator is constant, and the denominator decreases for t € [c/2,c¢] D [d, c]. 
Hence, by increasing m3 we decrease t, and increase the denominator, and decrease 
the ratio, as required. 

In conv{O, B, BD}, increase of m3 implies that of t (see (22.44)). Ast > 6 
increases, so does the denominator of the first representation in (22.43). Since 
the numerator is a positive constant, the result follows. Likewise, from (22.14) we 
deduce that t(m3) is decreasing in conv{ B,C, AD}, and (22.13) is increasing in t 
as t € [a,c] D [a, d]. 


Lemma 22.7. L,.(M) is nondecreasing in m1 as m2, m3 are fixed, and nonincreas- 
ing in mz asm, ms are fixed, when M € AABC!Umem{C, AC}!Umem{B, BA}". 


Proof. Glancing at (22.37)—-(22.39), we immediately verify the latter statement. 
The former is also evident for M ¢ AABC". Differentiating (22.6), we get 
OL,(M) | 2(e— my) (em — m2) 5 
Om, (c? — 2cem4 + m2)” = 
in mem{C, AC}, because both factors of the numerator are nonnegative there. In 
the same manner we conclude the statement for M € mem{B, BA}". 


Lemma 22.8. L,.(M) is nonincreasing in m1 as m2, m3 are fixed, and nondecreas- 
ing in M2 as m1, M3 are fixed, when 


M € conv{O, B, D,C} Uconv{O,C, DC} Uconv{O, B, BD}. 


Proof. By (22.40), the assertion is obvious for conv{O, B, D,C}. Assume now that 
M € conv{O, B, BD}, and let first m, vary and keep m ,mg fixed. Increase of my 
implies that of t and t(t — b) (cf. (22.44)). Therefore, the central term of (22.43) is 
the ratio of two nonnegative factors: the former is decreasing in m1, and the latter 
is increasing. Also, 
2 
eee PS Tass, (22.55) 
Ome (ma ie bm) 

Observe that m3 — 6?m, = 0 is the plane supporting conv{O, B, BD} along OB, 
and the rest of the region lies above the plane. Therefore, (22.55) is nonnegative, 
and both ¢ and t(¢ — b) are nonincreasing in mz. Using (22.43) again, we confirm 
the opposite for L,(M). 

The last case requires a more subtle analysis. Expressing (22.41) and (22.42) in 
terms of 3 = 8(M) = cmg — m3 > 0 and y = 7(M) = cm, — m2 > 0, we obtain 
L,(M) = y3/[B(cy — B)] and t = B/+, respectively. Differentiating the former 
formally, we find 


9? (2cBy7' + 288'y — 3674! — cB'y") 


aoe Fey — BP 


(22.56) 
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Since 22 = 0 and & 
omy fa) 


My _ C, 
OL,(M)  ey7(2cy—38) — 3cy3(2c/3 — 2) 
er RS) (22.57) 
omy B(cy — 8) (cy — 8) 
The sign of (22.57) is the same as that of (2/3)c — t. We shall see that (2/3)c < t 
for all t € (d,c). It suffices to check that (2/3)c¢ < d = (1/3)[2a+ (a? + 3r?)!/?], 


and a simple algebra shows that this is equivalent to r < a. Therefore, (22.57) is 


negative, and L,. actually decreases in m,. Combining (22.56) with a 5 = cand 
aa = —1, we have 


aL,(M) 7 (36? 1. cy?) 2 3y4 ie = c?/3) 


ee ei, 
Om Boley = 8)? B(cy — B)? 
because t > (2/3)¢ > c/V/3, as we checked above. This ends the proof. 


Analyzing variability of L,(/) in conv{ B,C, AD}, we split the region into three 
parts. 


Lemma 22.9. Set a < a; = (a? +1?/3)/? < ag =a+r?/(3a) < d. 

(i) L,(M) is nondecreasing in m, as m2, m3 are fixed, and nonincreasing in m2 
as M1, M3 are fixed, when M € conv{ B,C, AAj}. 

(ti) L,(M) is nondecreasing in m, as m2, m3 are fixed, and in mz as m1, M3 
are fixed, when M € conv{B,C, Ay Ao}: 

(iit) L-(M) is nonincreasing in my as m2, m3 are fixed, and nondecreasing 
in mg as m4, m3 are fixed, when M € conv{B,C, AyD}. 


Proof. Rewrite (22.45) and (22.46) as t = B/y € [a,d] and L,(M) = 73/[(8 - 
by)(ey — 8)] for 


B= B(M) = —m3 + (b+ c)m2 — bem > 0, 
y= 7(M) = —m2 + (b+ c)m, — bc > 0. 


Therefore 
FOE ee i ce at ee i il ct 
, (8— ey — BP 
Because we = —bc and a =b+¢, 
OL,(M) _ 89 [2 (b? + bc +c?) y— 3(b+ c) 8] 
Om, (8 — by)?(ey — 6)? 
3 (a, —t 
- 3ERaG- aa ee 
Similarly, due to we =b+cand ae = —1, we derive 
aL,(M) _ 7 3R- (P+ ee+2)77] _ ay (Pa) es 
dmg (8 — by)?(ey — 6)? (Ody) (ey 8) , 


Accordingly, if M € ABCT for some t € [a,d], then (22.58) and (22.59) hold. 
Analyzing the signs of the formulae, we arrive to the final conclusion. 
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22.4 Minimizing Solutions in Boxes 


In Theorem 22.3, we describe all moment pints M,¢ at which the minimal positive 
value of function (22.26) over boxes B(€) is attained for arbitrary 0 < r < a* and 
e, > 0, 1 = 1,2,3. In contrast with Lemmas 22.6—22.9 proved by means analytic 
techniques, we here use geometric arguments mainly. Let V; = (a — «1, a? + ©, 
a? + €3), Ve = (a— 1, a? — €2, a? + €3), V3 = (at €1, a? — €2, a? + €3) denote three 
of vertices of the top rectangular side R(E) = {(m1, m2, a? + €3) : |my —a’| < «i, 
i = 1,2} of B(E). 


Theorem 22.10. For all 0 < r < a* and each of the following cases: 


Mre = V,€ AABC', (22.60) 
{Mres = ViVeaNABA,C, (22.61) 
{Me} = ViV2Nmem{B, AAy}, (22.62) 
{Mre} = VoV3N ABADC, (22.63) 
{Me} = VaV3Nmem{B, AAd}, (22.64) 
Myre = Vi € conv {B, C, AA, } (22.65) 
Mye = V2 € conv {B, C, AA; } (22.66) 


there exist €; > 0,7 =1,2,3 such that 
L£,(E) = Ly (Mre) > 0. (22.67) 


Proof. By Lemma 22.6, £,(E€) is attained on the upper rectangular side R(€) 
of B(E). We consider all possible horizontal sections W = W(e3) = {M € W: m3 = 
a? + €3}, €3 > 0, of the moment space and use Lemmas 22.7 - 22.9 for describing 
points minimizing L,(M) over all M € R(E) NW. We shall follow the convention 
of denoting by A the horizontal section of a set A at the level m3 = a? + €3. Also, 
we set A = (a,a’,a? + €3), and define O, B, C analogously. By (22.33), every 
section W is a convex surface bounded below and above by parabolas mz = m7 
and mz = (m3gm;)/?, 0 < m, < ma, respectively. The last belongs to a vertical 
side surface of W, and the latter is a part of its bottom. On the right, they meet 
at P= (ma/?,m3/>, ms) € Oov. 

We start with the simplest problem of analyzing high level sections, and then 
decrease €3 to include more complicated cases. We provide detailed descriptions of 
the sections for three ranges of levels m3 > c?, d? < m3 < c?, and a? < m3 < d?® that 
contain different number of subregions defined in Theorem 22.2. For each range, 
we further study shape variations of some subregion slices caused by decreasing the 
level in order to conclude various statements of Theorem 22.10. We shall see that 
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the number of cases increases as €3 — 0, and these which occur on higher levels 
are still valid for the lower ones. Since arguments justifying the particular cases for 
different levels are in principle similar, we present them in detail once respective 
solution appear for the first time and omit later. 

Suppose first that m3 = a® + €3 > c3. Only point of first four elements of 
partition of Theorem 22.2 attain the level. Moreover, no bottom points of AABC', 
mem{C, ACK. and mem{B, BA} belong to W. It follows that the shapes of the 
sections AA BC, conv{AC}, conv{B A} coincide with the projections of respective 
sets onto (m1,mz2)-plane. By Lemma 22.7, L,. decreases as we move left and up 
in each subregion on the level (here we refer to directions on the horizontal plane, 
but no confusion should arise if we use the same notions for describing movements 
in the space). The minimum over R(E) is so attained at its right upper corner Vj. 
Excluding cases V; € V, and Vi ¢ W that imply £,(€) = 0, we see that (22.60) is 
the only possibility. 

If d? < m3 < c3, all regions except of conv{O, B, BD} should be examined. It 
is only mem{B, BA} whose shape remains unchanged with respect to the previous 
case. The section of AABC', formerly triangular, is now deprived of its upper 
vertex C, being cut off by a line segment QS, say, with Q € BC and S € AC. 
The upper part of the linear border of mem{C, AC} is replaced by a curve SPc 
mem{C, AC}. There is a point U, say, such that {U} = SP DC. Lines SP, and 
QS € AABC, and QU c ABDC are the borders of conv{B, C, AD}. The last one, 
together with XQ and XU for some X € OC form the borders of the respective 
triangular section of the tetrahedron conv{O, B, D,C}. Note that X is a bottom 
point of W and belongs to the upper parabolic bound of W. Also, XU C AODC 
and UP c mem{C, DC} separate the points of conv{O,C, DC} from the rest of 
the section. Finally, V, is located below conv{O, B, D,C} and right to AABC!, 


among 0a, OB, and BQ € BC! and XQ € AOBC. It can be verified that the 
slopes of all curved and straight lines mentioned above are positive. Also, writing 
T,T> and Tt. we adopted the convention that T, has the first two coordinates less 
than TJ». 

A crucial fact for our analysis is the mutual location of A € A! and Q € BC. 
We consider a more general problem replacing Q € BC by M € BT for some t > a. 
Two-point representation of M gives 

m3 + bt(b+t) (b+t)ms3 + b7¢? 
bP +bt+e? ? B+ bth va) 
As m3 varies between b° and t?, then all m, increase, and M slides up from B to T 
along BT. It follows that M is above and right to A for sufficiently large m3 and 
below and left for small ones. For m3 = a? + €3, we can check that m , < a and 


M = M(ms) = ( (22.68) 


M2 < a? are equivalent to 
63 <r(t—a)(a+b+t), (22.69) 
ab + at + bt 


22.70 
b+t ? ( ) 


€3 < r(t — a) 
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respectively. The right-hand side of (22.70) is smaller than that of (22.69). It follows 
that as €3 decreases from t? — a? to 0, M is first located above and right to A, then 
moves to the left and ultimately below A. These changes of mutual location occur 
on higher levels for larger t. 

In particular, we show that Q € BC is left and above A on the section m3 = d?. 
To this end we set ¢; = d? — a® and t = c and check that (22.69) is true and 
(22.70) should be reversed. Indeed, the former can now be written as a? + 3ar? > 
d?, and further transformed into (t — 1)(t? — 20t — 8) < 0 under the change of 
variables t = (1 + 3r?/a?)!/? € (1,2). The cubic inequality holds true for t € 
(—o0, 10 — 6V3) U (1,10 + 6/3) D (1,2). Negation of (22.70) is equivalent to each 
of the following: 


3 
2]2+(1+se)'/7] > 54481¢— 270, 


(26 + 6t)(1 + 3t)1/? > 26 + 45t — 2727, 


27t? (13 + 94t — 27t7) > 0, 


for t = r?/a? € (0,1), and the last one is obviously true. 

It follows that for m3 = a? + €3 varying from c® to d? there are two possi- 
ble locations of Q with respect to A. For es > 3ar?, Q is right and above, and 
(22.67) enforces (22.60) like in the previous case. Otherwise this is only one of three 
possibilities. For the other we should assume that Vj Vo lies right to Q. Let $j, 
i = 1,2, stand for the intersections of SU and A;C, respectively. Each element of 
the sequence A, $, 51, Sg and U lies above and right to the preceding. Consider 
conv{ B,C, AA} located among QS, QS; and 59). If Vy belongs there, then 


R(E) NW Cc AABC' U mem{C, AC}" 


Umem{B, BA}! U conv { B,C, AA; }. (22.71) 


By Lemmas 22.7 and 22.9(i), L,. is increasing in m,; and decreasing in mz in this 
part of the horizontal slice, and (22.65) can be concluded. 

Assume now that V; is right to Q and above QS;. Then VijV2 and QS, C 
ABA\C cross each other, because V2 lies below A. We establish that (22.61) 
holds. If R(E) contains any points of the sets conv{O,C, C, DC}, conv{O, B, D,C}, 
and conv{ B,C, AD}, ie., ones lying above OS and SP, th then by Lemmas 22.8 
and 22.9(iii) they can be replaced by ones of R(E) (OSU SoP ), and the minimum 
of L, will not be affected. Similarly, by Lemmas 22.7 and 22.9(i), all points of R(E) 
located below QS; and SP can be replaced by some of R(E) N (QSq U SiP). We 
could exclude points of SP from consideration once we show that L,. increases along 
SP. Indeed, the points of SP havea parametric representation M = aT+(l—a)C 
with m3 = a® + 3 <c? fixed and t € (a,m}'°), and so all 
3 — ™z3 


Be ae cay sen 


(22.72) 
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3 


Cc —ms3 
2 ee Sees 22.73 
my o+ca+t’ ( ) 
3 
>» (c—ms) (e+?) 
SO ae 22.74 
Mie ce+ct+t2 ? ae) 


are decreasing. Accordingly, it suffices to restrict ourselves to analyzing L,(M) in 
R(E) Nconv{ B,C, AAs} which by Lemma 22.9(ii) is minimized at the lowest left 
point of the surface, i.e., the one defined by (22.61). 

Suppose now that m3 = a? +3 < d°. The most apparent difference from 
the previous case is presence of conv{O, B, BD}. This contains bottom moment 
points including P and is located along the upper border of W. Precisely, this 
is separated from conv{O, B, D,C} and conv{B, C, AD} by a line segment YZ, 
say, with Y € OD and Z € BD, and a curve ZP. respectively. Writing P, we 
refer to the notion introduced above, although now the point is defined for a dif- 
ferent level m3. It will cause no confusion if we use it and some other intersection 
points of horizontal sections with given segments in the new context. Furthermore, 
appearance of conv{B,C, AD} changes radically. Now its border has four parts: 


two linear QS and QZ and two curved ones SP and ZP. These are borders with 
A ABC", conv{O, B, D,C}, mem{C, AC}", and conv{O, B, BD}, respectively. On 
this level, the region is spread until the upper right vertex P of W. We can also ob- 
serve that the slice of conv{O, B, D, C} has four linear edges with vertices Q, X, Y, 
and Z. On the left to the tetrahedron,behind XY, we can find conv{O,C, DC}, 
which is the convex hull conv{ XY} of the piece of parabola mz = (m3m,)'/? be- 
tween X and Y. In contrast with the last case, the region is separated from P 
here. The remaining regions look in much the same way as above. There are 
Vr = conv{Q, OX, O B} below conv{O, B, D, C}, and AABC! = conv{B, A, Q, S} 
below conv{B,C, AD}, with the common border BQ € BC". Beneath AB, there 
is mem{B, BA}! with the lower border BA. The right part of parabola mz = m7, 
a<m,< ma, is contained in mem{C, AC}! that borders upon AABC' and 
conv{ B,C, AD} through AS and SP, respectively. 

Finally we examine slices of three parts of conv{B,C, AD} described in 
Lemma 22.9. conv{B,C,AA;} and conv{B,C, A; A2} are separated by QS; if A 
is below the slice. The latter borders upon conv{ B,C, AyD} along QS>. If we go 
down through and below Ag, and Aj, then S;, i = 2,1, are consecutively identical 
with P and then are replaced by some Z; € BA;M ZP, respectively, that move left 
and down so that for sufficiently small €3 they become situated below and right to 
A (see(22.69) and (22.70)). Note that for M = aT + (1 — a)B, formulae (22.72)- 
(22.74) hold with c replaced by b. These decrease in t as well, and therefore, L,(M) 
becomes smaller as we slide M along ZP down and left. 

The above observations are essential for detecting solutions to (22.31) for the case 
V, € AABC! Uconv{B,C, AA;}. Otherwise we could use previous arguments for 
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obtaining (22.71) and concluding either (22.60) or (22.65). Under the assumption, 
we study possible locations of M,¢ for three ranges of levels of horizontal slices. 

If aj} < ms < d’, then there are some moment points in R(E) located above 
Q51 USP. Ones that are below can be eliminated by Lemmas 22.7 and 22.9(i), 
and this will not increase minimal value of L,. If there are any M € R(E) located 
above and right to QS2 and SP, we reduce them using Lemmas 22.8 and 22.9(iii). 
It follows that M,¢ € ebity (EO A A, U SOP. Note that SoP lies above and 
right to A, has a positive direction, and L,. increases as M moves along SP to 
the right. We can therefore, eliminate SP from the further analysis. Referring 
to Lemma 22.9(ii), we assert that M;,¢ is the point of conv{ B,C, A, A3} NRE) 
with the smallest possible m, and mg. For a < m3 < as, we use arguments 
of Lemmas 22.7—22.9 9 again for eliminating all points situated below QS, U SP 
and above QZ» U WePs What remains is conv{B,C, Aj Ao}, and we eventually 
arrive to the conclusion of the previous case. Similar arguments applied in the case 
a? < m3 < a} enable us to assert that M,¢ € conv{ B,C, Ay Ast UZ P. Generally, 
the curve cannot be eliminated here, because Z; may lie left to A, and there exist 
rectangles containing a piece of ZAP and no points of conv{ B,C, Ay Ash See that in 
either part of the union function L, is increasing in m, and mg. mg. Consequently, the 
minimum is attained at the left lower corner of (conv{ B,C, A AcAsy UZP JA R(E). 


Note that the borders of conv{ B,C, A, A; Ao} consist of lines with positive slopes and 


so is that of ZyP. Therefore, in all three cases it makes sense indicating the lowest 
and farthest left point of respective intersections with rectangulars whose sides are 
parallel to m,- and m2-axes. 

Now we are in a position to formulate final conclusions, analyzing possible lo- 
cations of V2. Assume that V2 lies either below or on QS in the case m3 > ae ; 
Then V;V2 and Qj cross each other at the lowest and farthest to the left point 
of conv{ B,C, A, Ap} NR(E), and (22.61) follows. The same conclusion holds if V2 


lies either below or on QZ, in the case m3 < a If V2 is below Ze that is only 
possible when Z; lies left to A for some m3 < a?, then (22.62) holds, because R(E) 
does not contain any points of conv{B, C, Ay Ad}, and one that is the farthest left 
element of VR is that of intersection with V, V2. 

Assuming that V2 € conv{B, CAVA which is possible for Q lying below A, 
we obtain (22.67). Indeed, V2 is the left lower vertex of R(E), and so this is also 
the left farthest and lowest point of the respective intersection with the domain 


of arguments minimizing L, for every level between a? and d?. For m3 > a3, it 
suffices to consider V2 lying right to @S2. Like in the other cases considered below, 
this is only possible when Q is located below A. It is clear to see that the lowest 
and farthest left point of conv{ B,C, Aj Ast NR(E) is that at which V2V3 and QS 
intersect each other. If m3 < a3 and V9 lies right to QZ2, the same conclusion holds 
with Sj replaced by Z. Since QS2,QZ2 C ABA2C, we can summarize both the 
cases by writing (22.63). 
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The last possible solution (22.64) to (22.31) occurs when m3 < a3 and V2 lies 
right to ZP. In fact, the points M € ZP with m2 < a? are of interest. If 
a? < mg < a8, then ZP is a part of the left border of conv{ B,C, Ay Ao} and the 
other QZ is located below the level of V2V3. T herefore, LZ, is minimized at the 
intersection of Ze and V2V3. If m3 < a} and Z, lies below A, then it is also 
possible that Mre € Z)P. R(E). 


22.5 Conclusions 


Proof of Theorem 22.1. This relies on the results of Theorem 22.10. We consider 
various forms (22.60)—(22.67) of moment points solving (22.31) and indicate ones 
that satisfy (22.32) for some r < a*. Solutions to (22.32) with respect to r provide 
values of Prokhorov radius for various combinations of €;, 1 = 1,2,3. However, 
only (22.60) enables us to determine an explicit formula for the radius. In this 
case, evaluating (22.37) at Vi = (a — €1,a7 + €2,a3 + €3), we rewrite (22.32) as 
1 — (2ae, + €2)/r? = 1-7, and easily determine (22.5). Relation (22.3) expresses 
analytically the fact that V; lies on the same side of the vertical plane mz — (b+ 
c)m, + bc = 0 containing BO! as BC does. Condition (22.4) is equivalent to 
locating Vi above pl{ ABC} (cf. (22.50)). Both, combined with positivity of all €;, 
ensure that V; € AABCT. 

If Mpe € ViV2, then M,ge = (a — €1,m2,a? + €3) for some |mz — a?| < eg. 
Conditions (22.32) and (22.61) enforce the representation 


Mre = (1—r)Ai + ar(C — B)+rB, 


for some 0 < a < 1, that gives 


(l—r) (« + 7) an 2ar?+r(a—r)=a—e1, (22.75) 
2\ 3/2 
(1 —r) G + 5) + 2ar? (3a* +17) +r(a—r)? =a? +63, (22.76) 
in particular. By (22.75), 
2ar? =a-—r(a—r) —(1—-7r) (« =) ee e, € [0, 2r?] (22.77) 


enables us to describe the range of €, by (22.7). Plugging (22.77) into (22.76), we 
obtain (22.6). Furthermore, we apply 


2 
(l—-r) (« + 7) + daar? + r(a—r)? = ma, 


combine it with (22.77) and |m2 — a?| < eg, and eventually obtain (22.8). 
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Under (22.62), Mpe = (a — €1, m2, a° + €3) fulfills 


(l-r)t+r(a-r)=a-4, (22.78) 
(l—r)t??+r(a—r)? =m, (22.79) 
(l=r)e +r(a—r)? =a° + 3, (22.80) 
for some 
gearir 
a<t< (« ~ 7) : (22.81) 
|m2 —a?| < e2. (22.82) 


We determine ¢ from (22.78) and substitute it into (22.79)- (22.81). Then the 
first one combined with (22.82) yields (22.11). Condition (22.80) provides (22.9) 
defining a relation among €1, €3, and the Prokhorov radius. The last one coincides 
with (22.10). 

Cases (iv) and (v) can be handled in much the same way as (ii) and (iii), re- 
spectively. For the former, we use the relations 


27% ; 
(=r) las = +ar(é—-b)+rbh=m, i=1,2,3, (22.83) 


with 0<a<1,m, >a—«, m2 = a? — €&, and m3 = a® + €3. In the latter, we 
put the same m;, 7 = 1, 2,3, in the equations 


(l—r)t?+rb' = mi, i= 1,2,3, (22.84) 


with a <t<a+r?/(3a). Here we use the second equations of (22.83) and (22.84) 
for determining parameters a and t, respectively. Geometric investigations carried 
out in the proof of Theorem 22.10 make it possible to deduce that mz < a? implies 
my, <a for both (22.83) and (22.84). Therefore, M;¢ is actually located on the left 
half of V2V3. 

For the proof of last two statements we need a formula defining r in (22.31) for 
M,e € conv{B,C, AA}. Direct application of (22.45) leads to more complicated 
one than that derived from equations 


2ar? + (1—r)t+r(a—r) =m, (22.85) 
Aor?a+(1—r)t?+r(a—r)? =m, (22.86) 
2ar? (3a? +r?) + (1—r)t? +r(a—r)? =ms (22.87) 


for 0 < a < landt € [a,a+1r?/(3a)]. Subtracting (22.85) multiplied by 2a 
from (22.86) and adding (1 — r)a?, we find 


(1 —r)(t— a)? = ma — 2am + a? — 1’. 
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Since t > a, we can write it as 


=9 aa 3 1/2 
ad ee (22.88) 
ame 6s 


say. Using the formula and a representation of 2ar? derived from (22.85), we can 
transform (22.87) as follows: 


m3 = (3a? +r?) [my —a+r? —(1—r)t] +(1—r)(at+7)? 


= (3a? + r”) mi — (30° + iat (a— 1) +(1=—rja? —(1—r)r?r 
+(3a +7) (m2 — 2am, + a= r°) 


= 3am — (3a* — r*) my + 4° — ar® + (mz — 2am, + a® — r*) 7. 


Referring again to (22.88), we ultimately derive 


—ms3 + 3amz — (3a? — r?) m, +43 — ar? = (= — 2am, + a? — “) We (22.89) 


—meg + 2am, — a? +r? l-r 


Observe that for all M € conv{B,C, AD}, the denominator of the left-hand side is 
positive and so is the numerator. If (22.65) holds, we can further simplify (22.89) 
and conclude (22.18). For asserting that Vi € conv{ B,C, AA;}, it suffices to as- 
sume that it is located in the halfspace m2 — 2am, + a? — r? < 0, above pl{ BA,C} 
and below pl{ ABC}. The first requirement coincides with (22.19), and the oth- 
ers, by (22.50), represent conditions (22.20) on the range of ¢3. If Mre = Va, 
then (22.89) implies (22.21). Condition Vz € conv{ B, C, A; Ay} can be equivalently 
expressed by saying that V2 lies right to B! between the planes containing BO" 
BA, ABA2C and ABA,C. Analytic descriptions of the conditions are presented 
in (22.22) and (22.23). This completes the proof of Theorem 22.1. 


It is of interest if all representations of Prokhorov radius given in Theorem 22.1 
are possible for arbitrary a > 0 assuming various shapes of moment boxes B(€). 
Applying statements of Theorem 22.10, we assert that for all 0 < r < a* there 
are €;, 7 = 1,2,3, such that II(€) = r in each of Cases (i), (ii), (iv), (vi), and 
(vii). It follows from the fact that there are moment points M that belong to any 
of AABC', ABA,;C, conv{B, C, AA;}, i = 1,2, which are convex combinations of 
points situated above and right to A. Therefore, for every 0 < r < a* it is possible to 
construct a box such that M = M,¢ satisfies conditions of (22.60), (22.61), (22.63), 
(22.65), or (22.66). 

The remaining two cases need a more thorough treatment. Formulae (22.9) 
and (22.15) can be used in describing the Prokhorov radius of moment neighbor- 
hoods of some 6, iff (1—r)A;+rB, i= 1, 2, respectively, lie above the level m3 = a? 
for some 0 < r < a*. The conditions coincide with positivity of 


2 


3i/2 
ar) ==) fp + r(1-Z)-4, O<r<a*, i=1,2. (22.90) 
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We can check that 6; < (2, 6;(0) = 6/(0) = 0 for all a > 0, and 6/’(0) > 0 iff 
a < i/6, i = 1,2. Accordingly, (22.9) and (22.15) are applicable in evaluating 
convergence rates for a < 1/6 and 1/3, respectively. A numerical analysis shows 
that these are sufficient conditions for positivity of (22.90) on the whole domains. 
On the other hand, (;, i = 1,2, are nonpositive everywhere if a > 1 — (3/4)*/? & 
either 0.3505 or 0.5781, respectively. For intermediate a, the functions are negative 
about 0 and positive about a* = a, which means that (22.9), (22.15) are of use for 
sufficiently large boxes. For example, for a = 0.25, 3, changes the sign at r + 0.1196 
(G2 > 0), and so does 82 at r © 0.1907 (G1 < 0) for a = 0.45. If a = 0.34, then 
r, © 0.3162 and rg + 0.01004 are the roots of 3;, 7 = 1,2. 
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Chapter 23 


Geometric Moment Methods Applied to 
Optimal Portfolio 
Management 


In this chapter we start with the brief description of the basics of Geometric Moment 
theory method for Optimization of integrals due to Kemperman [229], [232], [234]. 
Then we give solutions to several new Moment problems with applications to Stock 
Market and Financial Mathematics. That is we give methods for optimal allocation 
of funds over stocks and bonds at maximum return. More precisely we describe 
here the Optimal Portfolio Management under Optimal selection of securities so to 
maximize profit. The above are done within the models of Optimal frontier and 
Optimizing concavity. This treatment follows [44]. 


23.1 Introduction 


The main problem we deal with and solve here is: the optimal allocation of funds 
over stocks and bonds and at the same time given certain level of expectation, best 
choice of securities on the purpose to maximize return. The results are very general 
so that they stand by themselves as “formulas” to treat other similar stochastic 
situations and structures far away from the stock market and financial mathe- 
matics. The answers to the above described problem are given under two mod- 
els of investing, the Optimal frontier and Optimizing concavity, as being the most 
natural. 

There are presented many examples all motivated from financial mathematics 
and of course fitting and working well there. The method of proof derives from 
the Geometric Moment theory of Kemperman, see [229], [232], [234], and several 
new Moment results of very general nature are given here. We start the chap- 
ter with basic Geometric Moment review and we show the proving tool we use 
next repeatedly. 

The continuation of this material will be one to derive algorithms out of this 
theory and create computer software of implementation and work with actual nu- 
merical data of the Stock Market. 
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23.2 Preliminaries 


Geometric Moment Theory (see [229], [232], [234], of J. Kemperman). Let 
Ji,-++;9n and h be given real-valued (6M A) (Borel and A)-measurable functions 
on a fixed measurable space T = (T,.A). We also suppose that all 1-point sets {t}, 
t € T are (6M A)-measurable. We would like to find the best upper and lower 
bounds on the integral 


lh) = f reyatat) 
given that zis a (with respect to A) probability measure on T with given moments 


Mo)= uj Jaden 
We denote by m+ = m*(T) the collection of all A-probability measures on T such 
that u(|g;|) < oo (fj =1,...,n) and p(|h]) < 00. For each y = (y1,.-.,Yn) € R”, 
consider the bounds L(y) = L(y | h) = inf w(h), U(y) = U(y | h) = sup p(h), such 
that 
wem*(T); wgi)=yj, Jal... 
If there is not such measure pz we set L(y) = co, U(y) = —o0. Let M*(T) be the set 


of all probability measures on T that are finitely supported. By the next Theorem 
23.1 we find that 


L(y | h) = inf{u(h): we M*(T), u(g) = y}, (23.1) 
and 
U(y | h) = sup{u(h): w € M*(L), u(y) = y}. (23.2) 
Here ju(g) = y means u(g;) = yj, j=1,...,n. 
Theorem 23.1. (Richter [307], Rogosinsky [310], Mulholland and Rogers [269]). 
Let fi,..., fw be given real-valued Borel measurable functions on a measurable space 
Q = (Q,F). Let pw be a probability measure on Q such that each f; is integrable with 
respect to yw. Then there exists a probability measure pi’ of finite support on Q 
satisfying 
(fj) = u(f;) for all j=1,...,N. 
One can even attain that the support of p' has at most N +1 points. 
Hence from now on we deal only with finitely supported probability measures 
on T. Consequently our initial problem is restated as follows. 
Let T 4 0 set and g: T > R", h: T > R be given (BN A)-measurable functions 
on T’, where g(t) = (gi(t),..-,9n(t)). We want to find L(y | h) and U(y | h) defined 


by (23.1) and (23.2). 
Here a very important set is 


V = conv g(T) CR”, 
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where “conv” means convex hull, and the range 
g(T) = {2 € R": z = g(t) for some t € T}. 


Clearly g is a curve in n-space (if T is an one-dimensional interval) or a two- 
dimensional surface in n-space (if T is a square). 


Lemma 23.2. We have y € V iff there exists up € M*(T) such that p(g) = y. 
Hence, by (23.1), 


Liy|h)<oco iff yeV. 
Similarly, from (23.2), 
U(y|h) > —-co iff ye V. 


We get easily that the interior of V, int(V) 40 iff 1, g1,.--,9n are linearly inde- 
pendent on T. 


So without loss of generality we suppose that {1,91,..-.,gn} is a linearly in- 
dependent set of functions on JT, hence int(V) #4 @. Also clearly we have 
U(y | hk) = —L(y | —h), and L(y | h) is a convex function for y € V, while 


U(y | h) is a concave function for y € V. We consider V* = conv g*(T) C R®*?, 
where g*(t) = (gi(t),...,gn(t),h(t)), t € T. By Lemma 23.2 we have that 
(y1,---,Yns Ynt1) € V* iff there exists p € M*(T) such that p(g;) = yj, 7 =1,...,n 
and u(h) = Yn4i. Clearly L(y | h) equals to yn+1 € RU {—co} which corresponds 
to the lowest point (in the infimum sense) of V* along the vertical line through 
Crecpeere rin 

Similarly U(y | h) equals to 7,,,,; € RU {oo} which corresponds to the highest 
point (in the supremum sense) of V* along the vertical line through (y1,..., Yn, 0). 
Clearly the above optimal points of V* could be boundary points of V* since V* is 
not always closed. 


Example. Let T be a subset of R” and g(t) = ¢ for all t € T. Then the graphs of 
the functions L(y | h) and U(y | h) on V = conv(T) correspond to the bottom part 
and top part, respectively, of the convex hull of the graph h: T — R. 


The analytic method (based on the geometry of moment space) to evaluate 
L(y | h) and U(y | h) is highly more difficult and complicated and not mentioned 
here, see [229]. But it is necessary to use it when it is much more difficult to describe 
convex hulls in R”, a quite common fact for n > 4. Still in R® it is already difficult 
but possible to describe precisely the convex hulls. Typically the convex hull V* 
decomposes into many different regions, each with its own analytic formulae for 
L(y | h) and U(y | h). 

So the application of the above described geometric moment theory method is 
preferable for solving moment problems when only one or at the most two moment 
conditions are prescribed. Here we only use the geometric moment theory method. 
The derived results are elegant, very precise and simple, and the proving method 
relies on the basic geometric properties of related figures. 
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Claim: We can use in the same way and equally the expectations of random 
variables (r.v.) defined on a non-atomic probability space or the associated integrals 
against probability measures over a real domain for the standard moment problem 
defined below. 

Let (0,7, P) be a nonatomic probability space. Let X be r.v.’s on o taking 
values (a.s.) in a finite or infinite interval of R which we call it a “real domain” 
denoted by 7. Here EF denotes the expectation. We want to find inf E(h(X)), and 


sup E(h(X)), such that 
xX 
Eg(X) = di, t= 1,605} d; ER, (23.3) 


where h, g; are Borel measurable functions from JZ into R with E(|h(X))), 
E(\gi(X)|) < oo. Clearly here h(X), g;(X) are F-measurable real valued func- 
tions, that is r.v.’s on o. Naturally each X has a probability distribution function 
F which corresponds uniquely to a Borel probability measure pu, ie., Law(X) = wp. 
Hence 


Eg(X) =f gle) arte) = | gio) due) = di, i= Aen 
T TI 
and 
Eh(X) = ih h(x) dF (2) = i, h(a) du(a). (23.4) 
J J 
Next consider the related moment problem in the integral form: calculate 
int f(t) du(t), sup f A(t) du(t, 
BSS BOSS 
where yp are all the Borel probability measures such that 
fi oQau)=d, a= 1,040, (23.5) 
J 


where h, g; are Borel measurable functions such that 


/ In(t)| du(t), | la:(2)| dyi(t) < 00. 
J J 


We state 
Theorem 23.3. Jt holds 
inf ER(X) = inf pm du(t). 23.6 
sup ( ) (sup) all yz as in (23.5), J ( ) Hal ) ( ) 
all X as in (23.3) Borel probability measures 


Proof. Easy. 


Note. One can easily and similarly see that the equivalence of moment problems 
is also valid either these are written in expectations form involving functions of 
random vectors or these are written in integral form involving multidimensional 
functions and products of probability measures over real domains of R*, k > 1. Of 
course again the underlied probability space should be non-atomic. 
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23.3 Main Results 


Part I. 
We give the following moment result to be used a lot in our work. 


Theorem 23.4. Let (0,F,P) be a non-atomic probability space. Let X, Y be 

random variables ono taking values (a.s.) in [—a, a] and |—6, 3], respectively, a, 3 > 

0. Denote 7 := E(XY). We would like to find sup 4, inf 7 such that the moments 
X.Y ; 


EX =jn, BY Sry ja €[—a,al, “4 € [|-8, 6] 


are given. 
Consider the triangles 


T, = ({(-a, — 6), (a, —9), (a, 8)}), 
Tz = ({(—a, — 6), (—a, 8), (a, 8)}), 
Ts = ({(a, —9), (a, 8); (—a, B)}); 

Ta = ({(a, —9), (—a, 8), (—a, —8)}) 


Then it holds (Here \,p,p >0:A\+p+yp=1.) 
I) Of (41,11) ETL Th, te. py = a(—A+ p), 1 = B(y — 1), then 


sup y = —Gu1 + a, + af, 

X,Y 

int = —Bu — av, — af. (23.7) 
YW) If (w1,11) € TN T3, we. pr =a(1 — ¢), 11 = B(-A + p), then 

sup y= —Sp1 + av, + af, 

X,Y 

ty = By, + av, — af. (23.8) 
III) If (41,11) € T2N Ts, i.e. wy =a(A— p), 1 = BIL — 9), then 

sup Y = Sy — a + af, 

X,Y 

ant = Buy + ar, — af. (23.9) 
IV) If (41,1) € ToNTh, te. wy = aly —1), 4 = B(A— p), then 

sup y = Bur — av, + a8, 

X,Y 

pe = —Bu, — av, — af. (23.10) 

Proof. Here y := EXY, such that 


—a<X <a, —-B<Y <8, a, 3 > 0, 
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and the underlied probability space is non-atomic. That is, 


y= | xy du, (23.11) 
[—a,a] x[-8,6] 


where p is a probability measure on [—a, a] x [—{, 3]. We suppose here that 


/ cdp = 1, / yd =, (23.12) 
[-a,a]x[—6,6] [-a,a]x[—6,6] 


where pi € [—a,a], 1 € [—G,G]. Clearly we must have always —a < py < a, 
ES ys 

Here we consider and study the surface S$: z = zy, |z| < a, |y| < 6. We 
consider also the tetrahedron V with vertices A := (a, 3,aG), B := (—a,—,a{8), 
C := (a,—8, —aZ), D = (—a, 8, —aB). Notice (0,0,0) € V and we establish that V 
is the convex hull of the surface S. 

Clearly we have 


1) Line segment (£1) through points B and C: 


x= —a-+ 2at 
y=—6 , O<t<l. (23.13) 
z= a —2aft 


Notice that xy = z so that (¢) is on surface S. 
2) Line segment (¢2) through points A, C: 
r=a 
y=—-B+268t $, O<tK<1. (23.14) 
z= —af8 + 2aBt 
Notice that zy = z so that (£2) is on surface S. 
3) Line segment (¢3) through points A, D: 
x= —a-+2at 
y= , O<t<1. (23.15) 
z= —afB + 2abt 
Notice that zy = z so that (¢3) is on surface S. 


4) Line segment (¢4) through points B, D: 


r=-a 
y=—-B+26t p, O<t<1. (23.16) 
z= a —2aft 


Notice that xy = z so that (€4) is on surface S. 
We consider also the following line segments. 
5) Line segment (¢5) through the points C, D: 
xz =a-— 2at 
y=—-B+26t, O0<t<1. (23.17) 
z=-—a 
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6) Line segment (¢¢) through the points A, B: 


x=a-—2at 
y=G8—26t >, O<t<1. (23.18) 
z=a 


A 
Therefore the surface of V consists on the top by the triangles: (A B D) and 


(A B C), and in the bottom by the triangles (B C D) and (A C D). 

In one case we have (t1,/1) € triangle T; := ({(—a,—/(), (a,—(), (a, 8)}), 
and in another case (1,1) € triangle T> := ({(—a, —8), (—a, 3), (a,)}). Thus 
(41,1) € Ty iff FA, p,p > 0: A+ p+y =1 with pw, = a(1—2A) and 1, = G(2y—1). 
Also (u1,1) € To iff SA*, p*, p* > 0: A* + p* + y* = 1 with py = a(2y* — 1) and 
vy = B(1 — 2A*). 


Description of the faces of V: 
1) Plane P; through A, B, C' has equation 


z=—Bxr+aytaB, (23.19) 


and it corresponds to triangle 7}. 
2) Plane P; through A, B, D has equation 


z= Ppx—ayt+ap, (23.20) 
and it corresponds to triangle T>. 
3) Plane P3 through A, C, D has equation 
z= Bbx+ay-—aB (23.21) 
and it corresponds to triangle T3. Here 
Ts := ({(a, —8), (a, 8), (—a, B)}). 
In fact (41,71) € Ts iff SA, p,y > 0: A+ p+y=1 with 
fy =a(l—2y), m= BU -2)). 
4) Plane P; through B, C, D has equation 
z= —Px —ay—af (23.22) 


and it corresponds to triangle T,. Here 
T4 = ({(a, B), ( a, 3), ( a, )}). 
In fact (t1,11) € T4 iff SA*, p*, p* > 0: A* + p* + y* = 1 with 
fy = a(2\* — 1) and 4 = B(2p* — 1). 


The justification of V is clear. 
Conclusion. We have seen that the tetrahedron V is the convex hull of surface 
S: z = xy over [—a,a] x [—G, 6]. Call O := (0,0), Ai := (a,8), Ag := (—a, 8), 
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Ag := (—a,—3), Aa := (a, —(). Clearly (A, A2A3 Az) is an orthogonal parallelogram 
with sides of lengths 2a and 2(. 
We notice the following for the triangles 


(23.23) 
That is 
A A 
TiN Ty = (A3 AsO), T,NT3 = (As O Ai), 
A A 
T2NT3 = (A1 O Az), T2NTs = (A3 O Ap). (23.24) 
Also we see the following: 


A 
1) Let (a,y) € (Ag Aa O) iff (2, y) € TN Ty iff e = a(-A +p), y = B(Y—1), 
where 


App 20: A+pt+top=1. 
A 
2) Let (a,y) € (As O A1) iff (2, y) € TT iff x = a(1 — ¢), y = B(-A + p), 
where 
Asp,p 20: At+pt+typ=1. 
A 
3) Let (x,y) € (Ai Az O) iff (x,y) € T2 Ts iff e = a(A— p), y = G(1 —y), where 
A,p,9 20:A+p+yp=l1. 


A 
4) Let (x,y) € (Ag As O) iff (a, y) € T2N Ts iff x = a(y—1), y = B(A—p), where 
App 20:A+p+y=1. 


Finally applying the geometric moment theory method presented in Section 23.2 
we derive 


I) Let (141,11) ET, NT), then 


su pf xy du = —Bu, + ar, + af, 
[-a,a] x [-6,] 


in aff cy du = —Byy — av, — af. (23.25) 
[-a,a]x [8,6] 
II) Let (41,11) € T, 1 T3, then 


sup | xy du = —By, + ar, + a8 
[—a,a] x[—6,6] 


int [ zy du = By, + av, — af. (23.26) 
MK J[-a,a]x[-6,8] 
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III) Let (41,11) € T2973, then 


sup | ry du = Bu, — ar + af, 
uw J[-a,a]x[-B,6] 


ant cy du = Bur + av, — af. (23.27) 
a,a] x[—8,6] 
IV) Let (H41, V1) € T2NT4, then 


su wp ft ry du = Buy — av, + a8 
[-a,a]x [-,6] 


in tf cy du = —Byy — av, — af. (23.28) 
a,a]x[—6,6] 


23.3.1. The Financial Portfolio Problem 

Let the r.v.’s X giving the annual return of one risky asset (say a stock) and 
Y giving the annual return of another risky asset (say a corporate bond) such that 
EX = 1, EX? = po, EY =, EY? = v2 are known, and EXY = y may not be 
known. Here X,Y are not necessarily independent. 

Denote by R the return of a portfolio of the stock and bond R= wX+(1—-w)Y, 
where w is the fraction of funds allocated to the stock. Here we would like to find 
the optimal w* such that the utility function 


u(R) = ER—2AVar R (23.29) 


is maximized, where > 0 is the risk-avertion parameter which is fixed. 

The above is the optimal frontier problem. Notice that X,Y are not necessarily 
normal random variables, typically they have variances much larger than means. 
So the not short selling case is 


maxu(R):0<w< 1. (23.30) 


In general may be £ < w < u, —oo < £ <u < oo. If & < 0, it means one can sell 
the stock and put it all in the bond. 
In Part II of the chapter we deal in the same manner with another utility function 


u(R) = Eg(R), (23.31) 


where g(x) is a strictly concave function. This is the Optimizing concavity problem. 
However typically we deal with many random variables X;, Y;,7 = 1,...,n that are 
the annual returns of n stocks and bonds, respectively. Thus in general in portfolio 


N 
we have the return R, = >> w;X;i, where X; is the annual return from stock i or 
i=l 
bond i. 
Therefore we need to calculate 


max u(R,), such that u<w,<@;,, 7=1,...,N, (23:32) 
WN 


Losey 
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and u,;, &; are fixed known limits. To maximize u(R,) is not yet known even if i = 2 
and 0 < w <1. We present these for the first time here. 
Typically here the distributions of X; are not known. We work first the special 
important case of 
R=wX4+(1l-w)Y, O<w<1 or é<w<u, (23.33) 
and 
u(R) = ER—2Var R, A> 0. (23.34) 
One can easily observe that 
u(R) = wEX + (1—w)EY — (E(R?) — (ER)’) 
= wi + (1— wm — A{w? BX? + (1—w)? EY? + 2u(1 — w) EXY 


w pt — (1 — w)?v7 — Qw(1 — w) rr}. 


So after calculations we find that 


f(w) := u(R) = Tw? + Mw K, (23.35) 
where A > 0, 
T := —)ptg — Ave + Dy + Au? + Av; — Wu, (23.36) 
M::= jy + Qdryg — 27 — 2Av7 + Dy, (23.37) 
and 
K = v1, — Avg + v7. (23.38) 


To have maximum for f(w) we need T < 0. 
To find max u(R) is very difficult and not as important as the following modifi- 
wy 


cations of the problem. More precisely we would like to calculate 


max (1 WR) : (23.39) 
w y 
the best allocation with the best possible dependence between assets, and 
max (sin WR) ; (23.40) 
w y 


the best allocation under the worst possible dependence between the assets. Accord- 
ing to Theorem 23.4 we obtain 


Yinf < i < Ysup> ie. — Ysup < —7 < —YVinf- (23.41) 


So in the last modified problem 7 could be known and fixed, or unknown and variable 
between these bounds. Hence for fixed y we find max u(R), and for variable 7, using 
Theorem 23.4, we find tight upper and lower bounds for it, both cases are being 
treated in a similar manner. 


Geometric Moment Methods Applied to Optimal Portfolio Management 311 


Denote 
A(w) := (—Ap2 — Ava + ApT + AVP — 2p) w? 
+ (py — Vy + 2Avg — 2AV? + QAp)w + (Vy — Avo + AVZ). (23.42) 
Therefore from (23.35) we obtain 


u(R) = A(w) + 2Aw(w — 1)7. (23.43) 
In case of 0 < w < 1 we have 
min u(R) = A(w) + 2Aw(w — 1) (max 7) ; (23.44) 
Le. 
min u(R) = A(w) + 2Aw(1 — w) ( (sup )) ; (23.45) 
me inf 


y 


Using Theorem 23.4 in the last case (23.45) we find tight upper and lower bounds 
for max u(R). 


Proposition 23.5. Let y be fixed and w € [0,1] or w € [£,u], -co < £cu<om. 
Suppose T <0 and —4f € [0,1] or -H€ [é,u]. Then the maximum in (23.35) is 


M —M? +4TK 
eee = ——____. 23.4 
r( *) max u(R) TT (23.46) 
Eig. X=1, y= 6, po = 5, = 10, wy = 2, 4 = 3. Then T= -2 <0, M =1, 
K =2, —3£ =0.25 € [0,1], and maxu(R) = 4. 


In another example we can take 
A=1, wi = -10, 4) = —20, wo = 4, 12 = 6, y = 2, 
then T = 94 > 0. 


Proposition 23.6. Let y be fired and w € [0,1] or w € [€,u], -co < Lc u<om. 
Here T > 0 or T < 0 and —+t ¢ [0,1] or T < 0 and —#£ ¢ [,u]. Then the 
maximum in (23.85) is 


max u(R) = max{ (0), f(1)} = max{ KAGE 4a) yn}, (23.47) 
and 
Rae u(R) = max{ f(u), f(@}. (23.48) 


Application of y-moment problem (Theorem 23.4) and Propositions 23.5 
and 23.6. Let 0 < w <1, from (23.43) we find 


u(R) = A(w) + 2Aw(1 — w)(—9), (23.49) 
where y = EXY and A(w) as in (23.42). Then 
u(R) < A(w) + 2Aw(1 — w)(— ying) =: O(w), (23.50) 
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and 
u(R) > A(w) + 2Aw(1 — w)(—Ysup) =: L(w). (23.51) 
Hence one can maximize 0(w) and L(w) in w using Propositions 23.5, 23.6. That 


is finding tight estimates for the crucial quantities (23.39) and (23.40). That is, we 
obtain 


R))< 6(w), 23.52 

ama, (maxcu( )) < max O(w) (23.52) 
and 

i R))> L : 23.53 

max, (nin )) > max, E(w) (23.53) 


Notice that in expressing u(R) we use all moments p44, 1, 2, V2. 


23.3.2. The Portfolio Problem Involving Three Securities (similarly one 
can treat the problem of more than three securities). 

Let X1, X2, X3 be r.v.’s giving the annual return of three risky assets (stocks 
or bonds). 

Let R:= wi X1+w2X2o+w3X3, 0) <w; < uz, i= 1,2,3 the return of a portfolio, 
and the utility function u(R) = ER — Var R, \ > 0 the risk-avertion parameter 
being fixed. Again we would like to find the optimal (w1, we, w3) such that u(R) is 
maximized. Clearly 


u(R) = wi EX, + w2EX2 + w3EX3 — A(E(R?) — (ER)’). 
Set pug = EX), pa = EX?, i =1,2,3. We have pu; < ps!?, i = 1,2,3. Hence we 
obtain 
u(R) = wifi + woftie + w3p13 A[wi EX? + weEX3 + wEX? 
+ 2wywoE X1 Xo + 2wow3Ek X2X3 + 2w3w, Bb X3X1 we pry we pr, we is 


— 2wywepripi2 — 2wewsp2113 — 2wiwspaipas]- 
Call N= EX, Xa, 12 -= EX2X3, 3° EX3X, then 


1/2 1/2 1/2 1/2 
vn < pf eee 2 < po tee 


Therefore it holds 


1/2. 1/2 
HeS ay Bon 


u(R) = wi pri + weptie + w3h13 wy Apar a ws Ap22 ame w3 p23 

— 2wiwery1 — 2wew3Ay2 — 2w3wirAy3 + wi AUT + Wert + WEALZs 
+ 2wy weALi1 {12 + 2wew3AL12 113 ae 2wyw3AL11 M13. 

Consequently we have 

g(w1, w2, ws) = UR) = (—Apar + Aui wT + (—Api2a + Aun) w3 

(—Apio3 + Apis) wz + (Hi1wi + pi2we + f13w3) 

(—2A 71 + 2A pia) wi we + (—2Ay2 + 2ZAp12/113) Wows 

+ (—2A93 + 2Api1 p13) Ww. (23.54) 
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Here we would like first to study about the absolute maximum of g over R°. For 
that we set the first partials of g equal to zero, etc. Namely we have 
Dyg(wi, we, w3) = (—2Apar + 2Auz,)wi + par 
+ (—2A 91 + 2Api1 iz) we + (—2AY3 + 2Apiipi13)w3 = 0, 
(23.55) 
Dog(w1, w2, wz) = (—2Api22 + 2Apty)we + pr2 
+ (—2A 91 + 2Apii iz) wi + (—2AY2 + 2Api2p13)wW3 = 0, 
(23.56) 


and 


D3g(wi, W2, w3) = (—2Apa3 + 2Ap23)wW3 + M13 
+ (—2A72 + 2A p12 [13 ) We + (—2A73 + 211413) W1 = 0. 


(23.57) 
Furthermore we obtain 
Dirg(wi, w2, w3) = (—2Apo1 + 2Au7,), (23.58) 
Dogg(w1, w2, w3) = (—2Ape2 + 2Au70), (23.59) 
and 
D339(w1, We, w3) = (—2Apo3 + 2Au73). (23.60) 


Also we have 


Dizg(wi, We, W3) = Daig(wr, W2; ws) = (—2A71 + 2p11 1412), (23.61) 
D31g(wy, W2, w3) = Di39(w1, we, w3) = (—2Ay3 + 2Apiif13), (23.62) 


and 
Do3g(wi, W2, wW3) = D32g(wi, we, w3) = (—2Ay2 + 2Api2/13). (23.63) 


Next we form the determinant of second partials of g 


2X p21 t DNpi2 5 271 t 2A p11 L125 —2\73 + 2A p11 113 
A3(w) = 271 fo 2A p11 L12; —2Ap122 + Dp. —2\72 + 2A p12 113 5 (23.64) 
—2dy3 + 2Apri pig, —2A72 + 2ZApi2H13, —2Ape3 + 2Auts 


where w := (wi, We, w3) € R®. 
One can have true A3(w) 4 0. So we can solve the linear system over R? 


(—2Apor + 2Au2, wr + (—2\y1 + 2A pie) we 
(—2y3 + 2Ap11H13)W3 = —H11, 

(—2Ay1 + 2Apaipaz)wi + (—2Ape22 + 2Apity)we 
(—2Ay2 + 2Api2H13)wW3 = —}12, 

(—2ry3 + 2Apiipi3)w1 + (—2Ay72 + 2Api2f13)we2 
+ (—2Apia3 + 2Au73)w3 = —p13. 


(23.65) 
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Here A3(w) is the basic coefficient matrix of system (23.65). 

Given that As(w) #4 0 the system (23.65) has a unique solution w* := 
(wi, w5, wi) € R°. That is, w* is a critical point for g. 

Assuming that g is a concave function (it is also differentiable) then it has an 
absolute marimum at w* over R3 (see [196], p. 116, Theorem 3.7 and comments of 


3 
“Functions of several variables” by Wendell Fleming). If w* € [][é:, ui] then the 
i=l 


global maximum of g there will be g(w*). 


Note. In general let Q(x,h) := S2 gij(a)hihj, where g € C?)(K), K-open convex 
ij=l 


CR”, h:= (hy,..., An), © = (X1,.--,2n). 
We need 


Theorem 23.7 (p. 114, [196], W. Fleming). 


(a) g is conver on K iff Q(x,-) > 0 (positive semidefinite), Vx € K, 

(b) g is concave on K iff Q(a,-) <0 (negative semidefinite), Va € K, 

(c) if Q(z,-) > 0 (positive definite), Vx € K, then g is strictly conver on K, 
(d) if Q(a,-) <0 (negative definite), Vx € K, then g is strictly concave on K. 


Let A(x) := det[g;;(x)] and A,-;,(x) denote the determinant with n — k rows, 
obtained by deleting the last k rows and columns of A(x). Put Ao(x) = 1. For our 
particular function (23.54) we do have 


g(t) = g(x), 2,7 =1,2,3. 


That is Q(a#,-) is a symmetric form. From the theory of quadratic forms we 
have equivalently that Q(a,-) is positive definite iff the (n + 1) numbers Ao(z), 
Ai(a),...,An(x) > 0. Also Q(a, -) is negative definite iff the above (n+ 1) numbers 
alternate in signs from positive to negative. 

In our problem we have 


Ao(w) = 1, (23.66) 

Ai(w) = —2Apor + 20u7,, (23.67) 
oie pe 1 

A>(w) 2 {21 My V1 are (23.68) 
—2Ay1 + 2Apii M2, —2Ap22 + 2Ap{2 


and A3(w) as before, all independent of any w € R%. We can have examples of 
Ai(w) < 0, As(w) > 0, As(w) < 0, Vw € R%, see next. Therefore in that case 
Q(w,-) is negative definite, ic. Q(w,-) <0, Vw € R’. Thus (by Theorem 3.6, [196]) 
g is strictly concave in R°. 


Note. Let f be a (strictly) concave function from R into R. The easily one can 
prove that g(wi, we, w3) := f(wi) + f(we) + f(ws) is (strictly) concave function 
from R? into R. 
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Example. Clearly f(x) := —0.75x? + 0.52 is strictly concave. Hence 
g(wi, we, w3) = (—0.75)(w? + we + w3) + (0.5)(wi + we + ws) (23.69) 


is strictly concave too. This function g is related to the following setting. 

Take A= 1). jar = pag = pig = 1/2, par = pos = fog = 1, 9s = 2 = 3 = 1/4 
Then Aog(w) = 1, Ai(w) = —1.5 < 0, Ao(w) = 2.25 > 0, As(w) = —3.375 < 0. The 
linear system (23.65) reduces to —1.5w; = —0.5, 7 = 1,2,3. Then w* = w; = 0.33, 
i= 1,2,3. That is, (0.33, 0.33, 0.33) is the critical number of g as above (23.69). 
Hence 


max |(—0.75)(w? + w5 + w3) + (0.5)(wi + we + w3)] = 0.249975, 
and wy € [—1,1] or in [0,1], i = 1, 2,3, etc. 
Comment. By Maximum and Minimum value theorem since y := Il [0;, Us], 
;,u; € R is a compact subset of R® and since g(w) = u(R) is a asians real 
valued function over y, then there are points w* and w, in y such that 
g(w") = sup{g(w): w € x}, 
g(ws) = inf{g(w): w € x}. (23.70) 
Remark 23.8. Let here w; € [0,1], 2 = 1,2,3. Then one can write 
u(R) = B(w) + (2Aw1we)(—91) + (2Awews3)(—72) + (2Awiw3)(—73), (23.71) 
where 
B(w) := (—Apar + Aut, wt + (—Ape22 + Anta) w3 
+ (—Apo3 + Auts)wW3 + maw + Wi2we + Misws 
+ 2Apr1fi2W1 We + 2Apiepi13Wew3 + 2Ap11Hi3W1w3. (23.72) 
By Theorem 23.4 again we find that 


—Yisup < —%V% < —Yiinf, t= 1, 2,3. (23.73) 


Consequently we get 
u(R) < Bw) + (2A\w1w2)(—V1 ine) + (2Aw2w3)(—72 ine) 
+ (2Aw1ws3)(—Y3 int) =: O(w), (23.74) 
also we obtain 
u(R) 2 Bw) + (2Aw1w2)(—V1 sup) + (2Awews)(—72 sup) 
+ (2Aw1wW3)(—3 sup) =: L(w). (23.75) 
Now we can maximize 6(w), L(w) over w € (0, 1]? as described earlier in this Section 


23.3.2. That is finding tight estimates for the crucial quantities (23.39) and (23.40). 
That is, we find 


max ( max WR) < max OA(w), (23.76) 
we [0,1]? \ (11,792,798) we[0,1]° 
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and 


max ( min WR) > max L(w), (23.77) 


w€[0,1]3 \ (41,772,783) ~ we[0,1]8 


Notice that in expressing u(R) we use all moments f4;, Wai, 7 = 1,2,3. 
23.3.3. Applications of y-Moment Problem to Variance 


Let -a<X<a,-8<Y<fber.v.’s,a,8>0. Set y:= EXY, py := EX, 
vy := EY, y, ti, 41 € R. The covariance is 


cov(X,Y) =y- wiv (23.78) 
and in our case it exists as it is easily observed by 
Var(X) < 2a”, Var(Y) < 26”. 
Then by Theorem 23.4 we have 


sup cov(X, Y) = supy — 1/1, (23.79) 
XY XY 

and 
inf cov(X, Y)= soe = p14. (23.80) 


We need to mention 
Corollary 23.9 (see [235], Kingman and Taylor, p. 366). It holds 
Var(X + Y) = Var(X) + Var(Y) + 2 cov(X,Y). (23.81) 


Thus we obtain 


sup Var(X + Y) < sup Var(X) + sup Var(Y) + 2supcov(X, Y) 
pa x¥ xy xX 
< sup Var(X) + sup Var(Y) + 2supcov(X,Y). (23.82) 
x Y X,Y 


And also we have 
inf Var(X + Y) > inf Var(X)+ inf Var(Y) + 2 inf cov(X, Y) 
X,Y XY X,Y X,Y 
> inf Var(X) + inf Var(Y) + 2 inf cov(X,Y). — (23.83) 
xX Y XY 
Above we have EX = p41, EY = 1, as prescribed moments. Notice that 
Var(X) = EX?— 9 and Var(Y) = EY? —v?. So we would like to find inf (Ew? dp 
mn 
such that 
/ x du = -i, a> 0, 


-—a 
where yz is a probability measure on [—a,a], etc. Applying the basic geometric 
moment theory one derives that 


sup Var(X) = a? — yi, 
xX 


sup Var(Y) = 6? — 2%, (23.84) 
Y 
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and 
inf Var(X) = 0, inf Var(Y) =0. (23.85) 
We have proved 


Proposition 23.10. Let the r.v.’s X,Y be such that -a< X <a, -B<Y <8, 
a,B>0 and EX = p,, EY = 1 are prescribed, where 1,1, € R. Set y:= EXY. 
Then 


sup Var(X + Y) < a? + 6? — (u1 +11)? + 2yeup, (23.86) 
X,Y 
and 
inf Var(X + Y) > 2(rint - [). (23.87) 


Here Ysup and Ying are given by Theorem 23.4. 


(A) Next we generalize (23.86) and (23.87). Namely let X1, X2,..., Xn be r.v.’s 
such that —a; < X; < a;, aj > 0, i =1,...,n. Thus E|X;| < a; and EX? < a?. 
We write uj; := cov(X;, X;) ee do exist. Let 0 < 0; < w; < uj, 2 = 1,...,n, 


usually w; € [0,1] be such that w; = 1. Here w,’s are fixed, they could be the 


percentages of allocations of funds ‘to bonds and stocks. 
Set 


Z=S wiki. (23.88) 


Then by Theorem 14.4, p. 366, [235], we obtain 
Var(Z aS s wWiwjVi;- (23.89) 
a1 p= 1 
That is, we have 


Var(Z =o vit 3 eer cov(X;, X;) 


t=1 j=l 


tAj 
= =u Var(X;) + +S eo cov(X;, Xj). (23.90) 
w=1 g=1 
WJ 
Set 
Mi s= EX, Vij cs EX;X;, a9 = 1, see Me (23.91) 


Proposition 23.11. We obtain 


Var(Z) < Yow a? — p2;) + 3 a ( sup Yj ~ ts) ; (23.92) 


i=1 j=l Mili, Pilg 
tAj 
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and 


M1 


Var(Z) > > So wiw; ( —_ ep paints) : (23.93) 


(B) Let again X),...,X, be r.v.’s such that —a; < X; < a, aj > 0,7 = 
1,...,n.Set viz := cov(X;,X;). Let fixed w; be such that 0; < w; < wi, Ci,ui € R, 
7=1,...,n 

Set 


Z=S wii. (23.94) 
t=1 
Then by Theorem 14.4, p. 366 of [235] we get 


Var(Z = y= ye Wi Wj Vij 
a eu 2 Var(X;) + >> Swen (Xj, X;). (23.95) 
w=1 t=1 g=1 
ifj 


Furthermore we obtain 


Var(Z -> w? Var(X;) + S- SS ww; cov(X;, X;) 


i,j € {1,...,n} 
t#j 
Wi W5 > 0 
+ ye Se Wi W; cov(X;, Xj). (23.96) 
i,j € {1,...,n} 
t#j 
wiwj <0 


Here we are given HX; = fi, EX; = p1j;, we set also yj; := HX;X;. By Theorem 
23.4 we have 


Vij int S Vig LS Vij sup- (23.97) 
Obviously we derive 
inf cov(X;,.X;) < cov(X;,X;) < sup cov(X;, X;). (23.98) 
HiisMag Mis ba5 


We get 
Proposition aoe It holds 


Var(Z) < yt (a? a 2 ,) + Se 2 Wi Wj ( sup Vij — suse) 
:m} 


i,j € f{1,. poner 
i £5 j 
Wi W5 > 0 
UIE VL» n 
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and 


Var(Z) > > » Wi; (int - wits) 
i,j € {1,...,n} 
t#j 


Wi W; > 0 


+ ‘S os Wi W5 ( sup Vij — suis) , (23.100) 
ie ncaa pee 
t#j 
Wiwj < 0 
Part IT 
Here we treat the utility function 
u(R) = Eg(R), (23.101) 


where g: K C R — R*, K is an interval, g is a concave function, e.g. g(x) = 
(n(2+ a), 2 >-—1, and g(x) = (1+2)",0<a<1,¢>-—l. Also again 


R:=wX + (1-w)Y, 0<w<l, (23.102) 


and X, Y, r.v.’s as in Section 23.3.1. 
That is, it is 


u(R) = Eg(wX + (1—-w)Y). (23.103) 
Here the objectives are as in Sections 23.3.1, 23.3.2, etc. 
Lemma 23.13. Let g: K CR—-R, K interval, be a concave function. Then 
G(a,y) := g(wx + (1—w)y), O0<w<1 fixed, (23.104) 
is a concave function in (x,y). 
Proof. Let 0 < A < 1, then we observe 
G(A(2@1, y1) + (1 — A) (x2, ye)) = G(Ar, + (1 — A)ra, Ay1 + (1 — A)y2) 
= g(w(Aari + (1 — A)x2) + (1 — w)(Ay + (1 — A)ye)) 
g(A(war + (1 — w)yr) + (1 — A) (ware + (1 — w)y2)) 
> Ag(wai + (1 — w)y1) + (1 — A)g(wae + (1 — w)y2)) 
= AG(a@1, y1) + (1 — A)G(a2, y2). 


I 


Vv 


So in our case G(x, y) is concave and nonnegative. We would like to calculate 
inf u(R), i.e. to find 
sup 


‘iif i; Forename cicay) (23.105) 
kK2 


sup 
U 
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such that the moments 


jf edule.y) = 10, [yeule.y) =r (23.106) 
Ke Ke 


are prescribed, j1,1 € R. Here yp is a probability measure on K?. 
We easily obtain via geometric moment theory that 


sup u(R) = g(wui + (l—w)), g > 9, (23.107) 
Hu 
and 
Pie (xp x) = Ra g(wya + (1 — w)). (23.108) 
Examples 


(i) Let g(x) = n(24+ x), x > —1, then 
sup u(R) = €n(2 + wu + (1 — w)). (23.109) 
m7 


Suppose here , > > —1. Then wai + (1 — w)m > -1,0 < w <1. We would 
like to calculate 


max (sup W)) = max (én(24+ wi + (1 — w)1). 
we€ [0,1] im we [0,1] 


We notice that 


max (wui1+(1—w)1) =i atw=1. 
w€ [0,1] 


Consequently, since én is a strictly increasing function, we obtain 
max | sup u(R) |] = én(2+ p1). (23.110) 
we[0,1] \ (x,Y) 
(ii) Let g(x) = (14+ 2)°, 0O<a<1,x2>-1. Then 
sup u(R) = (14+ wai t+ (1 — w))*. (23.111) 
Lb 


Suppose again py, > Vv, > —1, then wu, + (1—w)m > —-1,0<w <1. The function 
(1+ x)® is strictly increasing. Therefore, similarly as in (i), we get 


max | sup u(R) } = (1+ p1)°. (23.112) 
we [0,1] \ (x,Y) 
Next we give another important moment result. 


Theorem 23.14. Let a, > 1 and T := conv({(0, 8), (a,—1), (—a,—1)}), and the 
random pairs (X,Y) taking values on T, i.e. 


X=alp-y), Y=AGB+1)-1, Ap,~>0: A+ p+ pul. (23.113) 
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Let g: R — Rt be a concave function and 0 <w<1. We would like to find 


ae. u(R) = ae Eg(wX + (1-w)Y) (23.114) 
such that 
Ex=11, EY =n, (23.115) 
where (41,11) € [-1,1]? is given. Also to determine 
Bia ( int, wr) F (23.116) 


We prove that 


inf u(R) = {ate —a)-1) 


(X,Y) 
4 gute DY gw Diy, pg aon 
(29( — w)@) — g(w(a + 1) — 1) — gw — a) — 1)) 
4 B+) (vy + uf. 


One can apply ats to both sides of (23.117) to get (23.116). 
we€|0,1 


Proof. Let a,3 > 1 and the triangle T with vertices {(0, 3), (a,—1),(—a,—1)}. 
Thus (#,y) € T iff (x,y) = A(0,@) + p(a,—1) + y(—a,—1), where A, p,y > 0: 
A+p+y¢=1, iff = pa— ya, y= AGB—p—y. That is, (x,y) € T iff x = a(p— y) 
and y = \(G+1)-—1. For example, if a = 2 then @ = 3 by similar triangles. Clearly, 
[fia ea 

Let g: R — R? be a concave function, then G(x,y) := g(wx + (1 — w)y), 
0<w <1 is concave in (x,y) and nonnegative, the same is true for G over T. We 
would like to find 


inf u(R) = int [ g(wx + (1 — w)y) du(a, y) (23.118) 
Le LOST 
under 
i rdp(x,y) = Ha, ‘A ydu(z,y) =, (23.119) 
T ir 


where (1,71) € [—1, 1]? is given, and yp is a probability measure on T. Call V the 
convex hull of Gl. The lower part of V will be the triangle 
T* = conv({(—a, -1, G(—a, -1)), (a, -1, G(a, -1)), (0, 8, G(0, B)) }) 
= conv({A, B,C}), (23.120) 


where 
A:= (-a, —l1,g(wil i a) 2 1)), 
B := (a,-1,g(w(a+1) - 1)), 
C := (0, 8, 9((1 — w)8)). 
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We can find that the plane (ABC) has an equation 


g(w(a + 1) — 1) — gw — a) — 1) 
( 7a ) era) 


y (22 wye) olla £1) slo) =D) og, 


z=g(w(il—a)—-1)+ 


2(6 +1) 
(23.121) 
Applying the basic geometric moment theory we get 
ing o(R) = {oop + Wer ND ge Dh 9 


+ SHOW oole DY ote, 4} 


2(8+ 1) 


Then one can apply re to both sides of (23.122). 
we€|0,1 


Application 23.16 to (23.117). Here 0 < w <1. Take a = 2 then we get 6 = 3. 
We consider g(x) = €n(3+ 2) > 0 for x > —2. Notice g(—w — 1) = én(2 — w) > 0, 
and g is a strictly concave function. 

Applying (23.117) we obtain 


(én(2 + 3w) — én(2 — w)) 


1 (p41 + 2) 


max ( inf We) = max {ine —w)+ 
we€[0,1] \ (X,Y) we [0,1] 
(20n3 + én(2 — w) — €n(2 + 38w)) 


7 8 


(1. + o}. (23.123) 


Let py = v; = 0, then 


oF ( et, WR) = Base {2 + én(2 —w) + én C= 1/2 
1/8 
+ én (=) ( = max {= in(r(u)} (23.124) 
where 
(w) := (2—w)*/8(24+3w)8? >0, O<wK<l. (23.125) 
We set 


t'(w) = (2 — w)~9/8(2 + 8w) 5/8 (1 — 3w) = 0. 


Here w = 1/3 is the only critical number for r(w). 
If w < § then 7/(w) > 0. 
If w > § then 7/(w) <0. 
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Thus 7(w), 0 < w < 1 has a local maximum at w = 1/3. Notice 7(0) = 2, 
7(1) = (125)1/8 = 1.8285791 and 


i 
T (5) = (347.22222)'/ = 2.077667. 


Thus the global maximum over [0,1] of the continuous function r(w) is 2.077667. 
That is, tT(w) < 2.077667, for allO0 < w < 1. Since én is a strictly increasing 
function we obtain that 


ént(w) < &n(2.077667) = 0.7312456, 


for all 0 < w < 1, with equality at w = 1/3. Consequently we have 


max ( inf WR) = 1.0058987, where wi =v, = 0. (23.126) 
we [0,1] \(X,Y) 


Application 23.16 to (23.117). Here again0 <w<1,a=2,8=3, 41 =™ =0. 
We consider g(x) := (24+ 2)°,0<a<1, a> —-2. Notice g is strictly concave and 
nonnegative. Also g(—w — 1) = (1—w)* > 0. Thus 


max ( fw”) = ms {a aay (G28 wr) 


we[0,1] \ (X,Y) we€[0,1 2 
(2 3w)* — (1+ 3w)* — (1 w)| 
+ ee 
8 
Eas Ca) (23.127) 
= —- Max Ww : 
8 welo] © : 


where 
x(w) := 2(5 — 3w)* +3114 3w)* +30 -—w)*, O< wl, 0<a<1, (23.128) 


is strictly positive and continuous. 
We have 


x'(w) = —6a(5 — 3w)*"* + 9a(1 + 3w)* + — 3a(1 — w)??, (23.129) 
which does not exist at w = 1, that is a critical point for y(w). Also we obtain 
x" (w) = 18a(a — 1)(5 — 3w)*? + 27a(a — 1)(1+ 3w)*? 
+ 3a(e@ —1)(1 —w)*-? <0, O0<w<l. (23.130) 


That is x’(w) is strictly decreasing on [0,1), ie. x’(1—-) < x’(0). Also y(w) is 
strictly concave on [0,1]. We rewrite and observe that 
i l-a _ 1 l-a fi as l-a _ 1 3 l-a 
vw) = a [6 [Sa 3we = + 3u) (L= we (143u)*7) 
(1+ 3w)!-°(5 — 3w)!-@ (1 — w)!-¢(1 + 3w)!-¢ 
(23.131) 
if 2 <w <1. That is, y(w) is strictly decreasing over [s. 1) Notice that 


x’ (0) = 6a(1 — 5°) > 0. (23.132) 
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That is, x at least close to zero is strictly increasing. Clearly there exists unique 
wo € (0,1) such that y/(wo) = 0, the only critical number of x there. 
We find (0) = 2(5° + 3), 
x(1) = 2-2%43-4%. (23.133) 
Consequently we found 
1 
inf == 2(5° 2° (2 - 2° ; 23.134 
Ene ( int, wr) 5 max (5% + 3),2°(2 +3. 2%), x(wo) } (23.134) 
Next we present the most natural moment result here, and we give again appli- 
cations. 


Theorem 23.17. Let 3 > 0, and the triangles 


Ti = conv({(G, Bs (8, —B), (—8, a )}) 
Tz = conv({(G, 3), (—8, —8); (—G, 8)}). 


Let the random variables X, Y taking values in [—G,@]. Let g: [-8,8] — R* 
concave function andO<w<1. We suppose that 


p 
p 


E 


g(B(1 — 2w)) + g(B(2w — 1)) = (9(-#) + 9(P)), (23.135) 
for dlO0<w<1. We would like to find 
inf Si - . 
ae we Eg(wX + (1-w)Y) (23.136) 
such that 
BX=i, EV =v; (23.137) 
are prescribed moments, 41,1 € [—{, 3]. Also to determine 
max ( inf WR). (23.138) 
w€ [0,1] \(X,Y) 


We establish that: 


Tf (41,41) € Ty (iff nn = B(A+p—-¢), 11 = BOA-p—¢), A, 2,9 = 0: At+p+y = 1) 
then it holds 


inf u(R) = {203 + CeCe i - 8) 
, (g(8)- aise =) ae a} (23.139) 


Tf (1,11) € Te (iff ws = BIA-—p-¢), 1 = BA-pt¢), A,2,9 20: A+ p+y=1) 
then it holds 


(9(8) — g(B0. — 2w))) 


inf u(R) = {at + (141 — 8) 


(X,Y) 
4 (GWG = 20) = 9-8) a (23.140) 
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One can apply at to both sides of (23.139) and (23.140) to get (23.138). 
we|0, 


Proof. Let g: [—, 8] ~ Rt be a concave function, 3 > 0. Then G(z, y) := g(wa+ 
(1—w)y) > 0, 2, y € [-8, 8], 0 < w <1, is concave in (x,y) over [—, B]?. We call 
R := [-, GB]? the square with vertices (G, 3), (—8, 8), (-8,—8), (8,8). Clearly 
(0,0) € R. On the surface z = G(x, y) consider the set of points 6 := {A, B,C, D}, 
where 


A:= (8,8, 9(B)), Bi= (B, —B, g(B(2w = Ly), 
C = (—8,—-8,9(-8)), D = (—8, 8, g9(8(. — 2w))). (23.141) 


These are the corner-extreme points of this surface over R. 

Put V := conv @. The lower part of V is the same as the lower part of V*—the 
convex hull of G(R). We call this lower part W, which is a convex surface made 
out of two triangles. We will describe them. 

We need first describe the line segments ; := AC, ly := BD. We have for 
£, := AC that 


x= B-—2(Ct 
y= B— 26t 
z= 9(8)+(9(-B)-9(8))t, O<t<1. (23.142) 
And for €) := BD we have 
x=—B+ 26t 
y = B— 26 


z= g(A(1 — 2w)) + (g(B(2w — 1) — g(BU— 2w)))t, OS t < 1.223.148) 


When t = 1/2, then « = y = 0 for both @,, @2 line segments. Furthermore their z 
coordinates are 


5 , and 
2w—1 1-2 
ay ee ee) Dear yy (23.144) 
Since G is concave, 0 < w < 1, it is natural to suppose that 
2(lo) > 2(£1). (23.145) 


Thus the line segment BD is above the line segment AC. 
Therefore the lower part W of V* consists of the union of the triangles 


A A 
P:=(ABC) and Q:=(ACD). 


A 
The equation of the plane through P = (A B C) is 


(G(2w — 1)) = g(-#)) G (9(2) = g(B(2w — 1))) 


z= 9(3)+4 oe —p)+ 2B (y-), (23.146) 
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A 
and the equation of the plane through Q = (A C D) is 


26 
(23.147) 


We also call 
Ty := proj,,(A BC) = convi({(8, 8), (8,8), (-8, -8)}), 


Ty *= prOjgy(A C D) = conv({(B, A), (—B,—B),(—B, B)}). (23.148) 
We see that 


(a,y) eT, iff JAj,p,~p>0:A+p+yp=1 
c= BA -_ 2), y= B(2A _ 1). (23.149) 


And we have 


(z,y) € To iff FA,p,p>0:A+p+yp=1 
with (x,y) = A(G, 8) + p(—B, —P) + o(—B, P), iff 


«= B(2\—-1), y= B(1— 29). (23.150) 
We would like to calculate 
L:= int [ g(wa + (1 — w)y) du(a, y) (23.151) 
M J[-B,6)? 
such that 
if xdu(x,y) = p1, / ydu(z,y) =, (23.152) 
[6,8]? [-6,8]? 
where (ji1,¥1) € Ty iff w1 = B(A+ p— ¥), 
1. = BA-p—-), Ap,~¢20:A+p+y=1, (23.153) 


or (1,11) € To iff wy = BIA- p—y), m1 = B(A-— pty), are prescribed first 
moments. Here py is a probability measure on [—(, 3]?, 3 > 0. 
The basic assumption needed here is 


o(B(L — 2w)) + 9(B(2w — 1)) > (o(—8) + 9(8)), for allO<w<1. (23.154) 
Using basic geometric moment theory we obtain: If (411,71) € T; we obtation 


(9(G(2w — 1) = g(-2))) (9(8) — g(B2w — 1))) 


L=g(8)+ 3B (pi =2)+5 2B QA 9): 
(23.155) 
When (11,11) € T2 we get 
L = g(B) + {g(6) — o(80 = 20) (, = Bab OOS aks) (v1 — 8). 


26 
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At the end one can apply Bas to both sides of (23.155) and (23.156) to find 
(23.138). 
Application 23.18 to (23.139) and (23.140). Let 

g(x) :=(1l+2)°%, 0<a<l, «>-1, 0<w<l, 


which is a nonnegative concave function. Take G = 1 and see g(1) = 2%, g(—1) = 0. 
If (1,71) € Ti, then from (23.139) we obtain 


a u(R) = 2% + 29-1 w% (a) — 1) + 29-11 — w®)( — 1). (23.157) 
If (441,71) € T> we get from (23.140) that 
int wl) = 2% 42°11 — (1 — w))(a — 1) + 2°11 — w)*% (4, — 1). (23.158) 


Notice that assumption (23.135) is fulfilled: 
It is 


(1-+ (1—2w))*+(1+ (Q2w—1))* = 2%((1—w)* + w) > 2%, 0<w <1, (23.159) 


true by 
(l—-w)* +w% > ((1—w)+w)* =1. 
Next take 1; = 1 =0, ie. (0,0) € T, NT2. Then 


inf u(R) = 2% —2°-1 23.160 
a ) ( ) 
and 
max ( inf u(R)) ey cae (23.161) 
w€[0,1] \(X,Y) 


Application 23.19 to (23.139) and (23.140). Let 
g(x) := n(2+2), r>-2, 0O<w<l 


which is an increasing concave function. Take 3 = $. Clearly g(x) > 0 for —4 < 
x <4. If (41,1) € Ti we obtain via (23.139) that 


inf (u(R)) = n(2.5) + (€n(w + 1.5) — én(1.5)) (1 5 5) 


(X,Y) 


1 
+ (¢n(2.5) — én(w + 1.5)) (. - 5) : (23.162) 
If (441,11) € T> we obtain via (23.140) that 


BBE (w(R)) = lr(2.5) + (En(2.5) ~ en(2.5 ~ w)) (1 - 5) 


+ (én(2.5 — w) — en(1.5)) (n- ); (23.163) 


ot 


We need to establish that g fulfills (23.135). 
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That is, that it holds 
én(2.5 — w) + €n(1.5 + w) > n(1.5) + €n(2.5), (23.164) 


as true. 
Equivalently we need to have true that 


én(3.75 + w(1 — w)) > &n(3.75), OS wl. (23.165) 


The last inequality (23.165) is valid since én is a strictly increasing function. So 
assumption (23.135) is fulfilled. 
Next take 4; = 1 =0, ie. (0,0) € T, NT. Then we find 


inf u(R) = 0.6608779, (23.166) 
(X,Y) 
and 
max ( inf WR) = 0.6608779. (23.167) 
we [0,1] \(X,Y) 


23.3.4. The Multivariate Case 
Let X; be r.v.’s, w; > 0 be such that Sw = land Rs = 2 wiXi. Let g bea 
concave function from an interval K CR ae Rt, and 7 
u(R) := Eg(R). 
We need 


Lemma 23.20. Let g: K — R be a concave function where K is an interval of R, 
and let 


G(x, a Fy) =J9 (>: vn) : 
i=1 
Then G(a1,...,%n) 18 concave over K”. 
Proof. Indeed we obtain (0 < A < 1) 
= Gir, + (1—-A)aet,..., Av, + (1 — A)ai)) 


= gq (>: wi(Agi, + (1 — x) 


II 
Ka) 
nN 
»~ 
ao 
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oe 
ae 
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We would like to determine 


sup u(R)= sup Eg (>: vx) =sup | g (doe) dp(ti,...,2n); 
1 n) i=l Kn Vel 


(X1,...,Xn) (X1,...,X im 
(23.168) 


such that 
EX;,= vj du(a1,..-,%r)= mi EK, i=1,...,n, (23.169) 
kn 
are given first moments. 
Here yz is a probability measure on K”. Clearly the basic geometric moment 
theory says that 


sup u(R)=g bs vi) ; (23.170) 


where g > 0 is concave. 
We further would like to find 


_, max ' ( sup un) = max g (>: vm) ; (23.171) 
= Wn Xn) w ; 


where w; > 0 and S> w; = 1. 
i=l 


Examples 23.21. 
i) Let g(x) := €n(2+ <2), « > —1, then 


sup u(R) = én (2 + 3 oi) 5 (23.172) 


i=l 


n 
where fy > fig > ++: > fn > —1, that is giving > wi, > —1. Notice that 
i=1 


nm 
SS wip < fa. Hence 
i=l 


max (>: vs) =p, when wy =1, wj=0, 7=2,...,n. (23.173) 
MHI 
Since én is strictly increasing we obtain 
max sup u(R) ) = én(2+ p11). (23.174) 
(w1,--,Wn) (X1 geoe5 Xn) 


ii) Let g(a) :-= (14+ 2)°, > -1,0<a<1. Then 


sup u(R)= (: + Sv) , where py > pg >++: > pn > 1, (23.175) 
) 3 
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giving )> wip; > —1, where w; > 0 such that S> w; = 1. 
ra 


= i=l 
Thus 
max sup u(R)} =max| 1+ S- witli) . (23.176) 
BN 5 ack SO) w = 
Since (1 + x)® is strictly increasing we get that 
max sup u(R)}| =(1+ p1)%, (23.177) 
(w1,--.,Wn) (X41 ‘endl Xn) 


at w, = 1, wi = 0,7 =2,...,0. 


Chapter 24 


Discrepancies Between General Integral 
Means 


In this chapter are presented sharp estimates for the difference of general integral 
means with respect to even different finite measures. This is accomplished by the 
use of Ostrowski and Fink inequalities and the Geometric Moment Theory Method. 
The produced inequalities are with respect to supnorm of a derivative of the involved 
function. This treatment follows [48]. 


24.1 Introduction 


This chapter is motivated by the works of J. Duoandikoetxea [186] and P. Cerone 
[123]. We use Ostrowski’s ([277]) and Fink’s ({195]) inequalities along with the 
Geometric Moment Theory Method, see [229], [20], [28], to prove our results. 

We compare general averages of functions with respect to various finite measures 
over different subintervals of a domain, even disjoint. The estimates are sharp and 
the inequalities are attained. They are with respect to the supnorm of a derivative 
of the involved function f. 


24.2 Results 


Part A 


As motivation we mention 


Proposition 24.1. Let 11, pe finite Borel measures on [a,b] C R, [c,d], [é,g] C 
[a,b], f € C*({a, b]). Denote pr({c, d]) = mi > 0, pe([é,g] = m2 > 0. Then 


i Vet 1 £9 

— [feds -— [flo sdys 
My ee msg = 

Proof. From mean value theorem we have 


f(x) — FY) < MF lloo(b- a) = 7, Ve,y € [a, 0). 


< || f'lloo(b— a). (24.1) 
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That is, 
-7Sf()-fy) <7, Ve,y € [a, 4, 
and by fixing y we obtain 
ik d 
=s — | f(x)din — fly) <7. 
My é 
The last is true Vy € [é, g]. Thus 
i. oe if? 
-ys ft pea)din —  f fle)dua <9, 
™1 Je m2 Jé 


proving the claim. 


As a related result we have 


Corollary 24.2. Let f € C'({a,]), [c,d], [é,g] C [a,b] CR. Then we have 


1 7? 1 9 
Fea fade — | f(a)dx 


We use the following famous Ostrowski inequality, see [277], [21]. 


<|[f'lloo-(b-@). (24.2) 


Theorem 24.3. Let f € C'({a,b]), x € [a,b]. Then 


yee) = pean < fle 


and inequality (24.3) is sharp, see [21]. 


We also have 


Corollary 24.4. Let f € C'({a,0]), x € [c,d] C [a,b] CR. Then 


Lf s@ f(x ae| < NP Fs Max{ ((c~a)?+ (c=?) (da)? + (d-0)} 


rc 2) = 26 
(24.4) 


hay 


Proof. Obvious. 


We denote by P([a, b]) the power set of [a,b]. We give the following 
Theorem 24.5. Let f € C1({a,b]), be a finite measure on ({c, d], P({c, d])), where 
[c, d] C [a,b] CR and m := p([c,d]) > 0. Then 

1) 


1 1 ? 
2 ed flaydu— —— | f(ax)dx 


IIF’Tloo 
~ 2(b-a) 


2) Inequality (24.5) is attained when d = b. 


Max{((c =a)? + (e—6)*), ((d— a)? + (d— b)*) }.(24.5) 
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Proof. 1) By (24.4) integrating against p/m. 
2) Here (24.5) reduces to 


= | pooydn— 2 f pea < I Filleo (, — 4g), (24.6) 
Mm Sie,b] b-aJa a 2 
We prove that (24.6) is attained. Take 
fra) =F) aces, 
Then f*/(x) = 5% and || f*’||0 = 744, along with 
b 
| f*(a)dx = 0. 

Therefore (24.6) becomes 

1 

— f*(x)du| < 1. (24.7) 

mM JTe,b] 


Finally pick 4 = 64} the Dirac measure supported at {b}, then (24.7) turns to 
equality. 

We further have 
Corollary 24.6. Let f € C'({a,b]) and [c,d] C [a,b] CR. Let M(c,d) := {u: wa 
measure on ({c,d], P([c,d])) of finite positive mass}, denoted m := ([c, d]). Then 


1) It holds 
1 tf? 
ae f(a)dp — i f(x)dx 


sup 
ue M (c,d) 
< we Max{((c — a)? + (e— b)’), ((d— a)? + (d—)?)} (24.8) 
IF lhoo (d—a)*+(d—b)’, ifdteza 1 
~ 3(6—a) * cee oe ae ifdtc<atb 
< I Tes (5 ay. (24.9) 
Inequality (24.9) becomes equality if d = b or c= a or both. 
2) It holds 
eae ES ant f IP lloe a 
sup (2°, + ie Fladu— 5a f fade < Hh (ba). (24.10) 


ax<c<d<b 


Next we restrict ourselves to a subclass of M(c,d) of finite measures y with 
given first moment and by the use of the Geometric Moment Theory Method, see 
[229], [20], [28], we produce a sharper than (24.8) inequality. For that we need 


Lemma 24.7. Let v be a probability measure on ([a,b], P([a, b])) such that 
/ adv = d, € [a,}| (24.11) 
[a,b] 


is given. Then 
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U1 := sup [,@ — a)’dv = (b—a)(d — a), (24.12) 


v aS iM (24.11) 


Ug := sup / (a — b)?dv = (b—a)(b— dy). (24.13) 
v as im (24.11) ¥ [4,2] 


Proof. i) We observe the graph 
Gy = {(2,(a@—a)?):a<a< bd}, 

which is a convex arc above the z-axis. We form the closed convex hull of G; and 
we name it G, which has as an upper concave envelope the line segment ¢; from 
(a, 0) to (b, (b—a)?). We consider the vertical line x = d; which cuts ; at the point 
Qi. Then U; is the distance from (d;,0) to Q1. By using the equal ratios property 
of related here similar triangles we find 

dy —a Ui 

b—a (b— a)?’ 


which proves the claim. 

ii) We observe the graph 

G2 = {(2,(@—b)*):a< a < Dd}, 

which is a convex arc above the x-axis. We form the closed convex hull of Gz and 
we name it Gs which has as an upper concave envelope the line segment ¢2 from 
(b,0) to (a, (b—a)?). We consider the vertical line 2 = d; which intersects ¢2 at the 
point Qo. 

Then U2 is the distance from (d1,0) to Q2. By using the equal ratios property 
of the related similar triangles we derive 

U2 b— dy 
(b— a)? b-a’ 


which proves the claim. 


Furthermore we need 


Lemma 24.8. Let [c,d] C [a,b] C R and let v be a probability measure on ({c, d], 
P([c, d])) such that 


| adv = d, € [c,d (24.14) 
[c,d] 


is given. Then 


(i) 


Uy := sup i (2 — a)°dv = di(c + d— 2a) — cd+a?, (24.15) 
v as in (24.14) / [c,d] 
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(ii) 


ess, eas / (x — b)?dv =dy(e+d—2b)—cd +B. (24.16) 
v as im (24.14) /[e,d] 


(iii) It holds 


sup i [(x — a)? + (w — b)?] dv = Uy + Up. (24.17) 
v as tn (24.14) [c,d] 


Proof. (i) We notice that 


d d 
i (2 — a)’dv = (c— a)? + 2(c— a)(dy — c) 4 / (a — c)?dv. 
Using (24.12) which is applied on [c, d], we find 


d 
sup i. (x — a)?dv = (c— a)? + 2(c — a) (dy — cc) 
v as in (24.14) Ve 


d 
u x —c)*dv 
ae fa oer) 
= (¢€-— a)? + 2(c— a)(d) — 0) + (d— e)(d; —c) 
= d(e+ d— 2a) — cd+a’, 


proving the claim. 
(ii) We notice that 


d d 
: (2 — b)?dv = (b— d)* + 2(b— d)(d— dy) 4 | (2 — d)?dv. 
Using (24.13) which is applied on [c, d], we get 


d 
sup / (x — b)?dv = (b— d)? + 2(b— d)(d— d)) 
v as in (24.14) Ve 


d 

+ u x — d)*dv 
v as i Pant: 

= (b— d)* + 2(b— d)(d— d,) + (d—)(d— dy) 

= di(c + d—2b)—cd+ 0", 


proving the claim. 
(iii) Similar to Lemma 24.7 and above and clear, notice that (2 — a)? + (a — b)? 
is convex, etc. 


Now we are ready to present 


Theorem 24.9. Let [c,d] C [a,b] CR, f € C'({a, b]), w a finite measure on ([c, dl, 
P((c, d])) of mass m := p([c, d]) > 0. Suppose that 


1 d 
= | adu=d,, c<d,<d, (24.18) 
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18 given. 
Then 


sup 


1 d 1 b 
m t)dp — —— x)dx 
p as above ah fia)de | 1 


e lflle 


~ (b-a) 


(24.19) 


lai((c+ a) ~(a +0) oe — 


Proof. Denote 


Bla) = PERE (a — a)? +(e -5)%), 


then by Theorem 24.3 we have 


1 
b— 


b 
Bla) < fle) ~~ f fade < Bla), Ye € fed, 


Thus 
d d b d 
an / Ala)dp < — / f(a)dp— / f(a)de < — y Ala)dp, 


and 


1 d 
co f (edn 0. 


2 [seh [reo 


Here v := 4 is a probability measure subject to (24.18) on ((e,d], P([e, d])) and 


Ne i gO Pa 
0 Hie ( fc pes fe b) us) 
IIF hoo ‘ 2 : 2 
- Hs. (| (x — a) w+ f (e-2) ww) 


24.15), (24.16) and (24.17) we derive 


Using (24.14), ( 
IPMIES | L gq?) 4 | L 2 
0< 2b — ay nlc + d — 2a) — ed + a”) + (di(e + d — 2b) — ed + b*)} 


SUM aca oes ce 
= files Ya +d) —(a+0))—ed+ 5 i; 


proving the claim. 
We make 


Remark 24.10. On Theorem 24.9. 
1) Case of c+ d>a+b, using d, < d we get 


a? + b? s (d—a)? + (d—b)? 
—— 


di ((c+ d) —(a+b)) —cd+ 5 


(24.20) 
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2) Case of c+ d<a+b, using d; > c we find that 


2 b2 = 2 —b 2 
dy((c + d) — (a+b) — ed + Pee laa 2 (24.21) 
Hence under (24.18) inequality (24.19) is sharper than (24.8). 
We also give 
Corollary 24.11. All assumptions as in Theorem 24.9. Then 
~ f te\au-— | flea 
IIF Theo a* +b 
+b 24.22 
= (b—a) i((e d) —(a )) d+ 2 ( ) 
By Remark 24.10 inequality (24.22) is sharper than (24.5). 


Part B 
Here we follow Fink’s work [195]. We need 


Theorem 24.12 ({195]). Let f: [a,b] = R, f("-» is absolutely continuous on [a,b], 
n>1. Then 


A b ates aoe (k-1) a — b\k — fl (a)(x — a)* 
f(z) = | feyar+ So( =) (Ae “ = wees 
e k=1 : 


—— 
(n—1l(b—a) 


/ ic — t)"-1 k(t, x) ff (£)dt, (24.23) 
where 


k(t, a) := 


_ <t<a< 
‘s a, ax<t<a<b, (24.24) 


t-b, a<a<t<b. 
For n =1 the sum in (24.23) is taken as zero. 


We also need Fink’s inequality 


Theorem 24.13 ({195]). Let f("-) be absolutely continuous on [a,b] and f™ € 
Lo(a,b), n> 1. Then 


n-1 db 
(10+ Sn) ~o | teas 
k=1 is 


[(b—a)"t1+(a@—a)"*"], Vee [a,b], (24.25) 


where 


F,(x) := (5 *) (Peete a er) (24.26) 


Inequality (24.25) is sharp, in the sense that is attained by an optimal f for any 
x € [a,b]. 
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We get 


Corollary 24.14. Let f— be absolutely continuous on [a,b] and f™ € L.(a,b), 
n> 1. Then Va € [c,d] C [a,b] we have 


n-1 b 
- (1 + > rt) ~ i | tea 
k=1 w 


Lf Ihoo [(b _ oa 4 (x -_ a)"*"] 
~ n(n +1)!(b- a) 
< aE —a)” (24.27) 
~ n(n+1)! : , 


Also we have 
Proposition 24.15. Let f("-!) be absolutely continuous on [a,b] and f(™ € 
Loo(a,b), n> 1. Let be a finite measure of mass m > 0 on 


({c, J, P([e,d])), [c,d] € [a,b] CR. 


it Ae cae | 1 f° 
(2 [fours  Fitew) - [ sae 


b—a 
IF hoo i a ee 
n(n + 1)!(6— a) E [0 sca af 


[e,d] 
IF hoo n 
N= ays 
~ n(n+ pi @) 


Proof. By (24.27). 


Then 


Ks: 


(x —- oy 


(24.28) 


Similarly, based on Theorem A of [195] we also conclude 
Proposition 24.16. Let f("~) be absolutely continuous on [a,b] and f(™ € 
L,(a,b), where 1 <p <ow,n> 1. Let p be a finite measure of mass m > 0 
on (|c,d], P([e, d])), le, d] C [a,0] CR. 

—— 


Here p' > 1 such that Es a = 1. Then 


rae at ee i 
—|— ax)d —_— Fy. (x)d = x)d 
(2 eg OU he sorte) saa | teva 


B((n— Up! +1,p! +1)” Fly 
ni(b— a) 


[of waa ts = a) MP dy 
™M J[c,d) 
/p' n—1+4 
B (n—1)p'+1,p' +1) * (b—a) 3 n 
=) IF lp. (24.29) 
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We make the following 
Remark 24.17. Clearly we have the following for 


g(x) = (b—x)"*1 4 (x —a)"t! < (b-a)""', ax<as<bd, (24.30) 
where n > 1. Here 7 = ath is the only critical number of g and 
a+b (b—a)”1 
1)-——._-_ > 0 
"(S) snr nl > 0, 


giving that g(4¢ by = &a)""" 5 0 is the global minimum of g over [a,b]. Also g is 
convex over [a,b]. Therefore for [c, d] C [a,b] we have 


M:= Boe {(x —a)"t! + (b—a)"t"} 


= max{(c—a)"** + (b—c)""1, (d—a)"** 4+ (b-d)"*}. (24.31) 
We find further that 


_ (d—a)"*14 (b-—d)"*1, ifc+d>at+b (24.32) 
(c—a)"™*1+(b-c)"*1!, ife+d<atb. 
If d= 6b or c=a or both then 
M = (b-a)"*1, (24.33) 


Based on Remark 24.17 we present 


Theorem 24.18. All assumptions, terms and notations as in Proposition 24.105. 
Then 


1) 
IF |Joo nt+l1 n+l 
_— n(n + Db —a) max{ (c a) + (b c) 3 
(d—a)"** + (b-d)"*"} (24.34) 
WF Lo (d—a)"*'+(b-d)"*!, ife+d>atb, 
= x 
n(n + 1)'(b — a) (c—a)"*14 (b—c)"*!, ifetd<atb 
AF lloo. 
< _ : 
Sime” (24.35) 
where K is as in (24.28). If d=b or c=a or both, then (24.35) becomes equality. 
When d = b, & = 6,4; and f(x) = ka =a)" ,a<a<b, then inequality (24.34) is 


attained, i.e. it becomes equality, proving that (24.34) is a sharp inequality. 
2) We also have 


sup K < RHS.(24.34), (24.36) 
we M (c,d) 
and 
sup ( sup «) < R.HS.(24.35). (24.37) 
all c,d we M (c,d) 


a<c<d< b 
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Proof. It remains to prove only the sharpness, via attainability of (24.34) when 
d= b. In that case (24.34) reduces to 


(2 [fe n+ 2 why wae) — A, [store 


1s (b—a)”. (24.38) 

The optimal measure here will be 4 = by and then (24.38) becomes 

, (r+ nw) 3 <lE lee ay. (24.39) 

ae ne | 

The optimal function here will be 
fq" a<a<b 
Then we observe that 
frED (a) = a eae k—-1=0,1,...,n—2, 


(n—k+1)!’ 


and f**-) (a) = 0 for k-1 = 0,1,...,n—2. Clearly here F,(b) = 0,k =1,...,n—1. 
Also we have 


—a)” 
= ———— and || f*™ |, = 
raf rear= SEO ona 17° 
Putting all these elements in (24.39) we have 


teas oe) te 
nn! (n+ 1)! n(n +1)!’ 


proving the claim. 


Again next we restrict ourselves to the subclass of M(c, d) of finite measures 
with given first moment and by the use of Geometric Moment Theory Method, see 
[229], [20], [28], we produce a sharper than (24.36) inequality. For that we need 


Lemma 24.19. Let [c,d] C [a,b] C R and v be a probability measure on ({c, d], 
P([c, d])) such that 


| adv = d, € [c,d (24.40) 
[c,d] 
is given, n>1. Then 


Wi := sup i) (¢— a)" dp 
[c,d] 


v QS in (24.40) 


= (s- —a)"*(e- 0) (d; —d)+ (d—a)"™. (24.41) 


k=0 
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Proof. We observe the graph 
GuH{@e— a nes a <a}, 


which is a convex arc above the x-axis. We form the closed convex hull of G; and 
we name it en which has as an upper concave envelope the line segment @; from 
(c, (c—a)"*1) to (d,(d—a)"*"). Call @ the line through @,. The line ¢; intersects 
x-axis at (t,0), where a <t <c. We need to determine t: The slope of ¢; is 


oe (gra =S(d-a)""*(e—a)*. 


k=0 
The equation of line ¢; is 


y=m-a2t+(d—a)"t' — md. 
Hence mt + (d—a)"*! — md = 0 and 


os n+1 
page arr 


m 
Next we consider the moment right triangle with vertices (¢,0), (d,0) and (d, (d— 
a)"*+), Clearly (di, 0) is between (t,0) and (d,0). Consider the vertical line x = dj, 
it intersects 2; at Q. Clearly then W, = length((d1,0), Q), the line segment of which 
length we find by the formed two similar right triangles with vertices {(t,0), (d1,0), 
Q} and {(t,0), (d,0), (d,(d —a)"*1)}. We have the equal ratios 
dy t WwW, 


d—t  (d—a)r*!’ 


d,—t 
= Le TET ft a7 
W, = (d a) (4). 


We also need 


Lemma 24.20. Let [c,d] C [a,b] C R and v be a probability measure on ([c, d], 
P([c,d])) such that 


/ xdv = d; € [c,d] (24.42) 
[c,d] 
is given, n>1. Then 

1) 

Wo := sup / (b—a)"* dp 
v as im (24.42) / [c,d] 
~ (s20 OS he 0) (c—d,)+(b-—e)"*1. (24.43) 
k=0 
2) It holds 


ap / [(a — a)"** + (b— 2)"*"] dv = Wi + Wa, (24.44) 
v As in (24.42) ¥ [c,d] 
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where W, as in (24.41). 
Proof. 1) We observe the graph 
G2 = {(2,(b—2)"*"): e< a < dh, 


which is a convex arc above the z-axis. We form the closed convex hull of Gz and 
we name it Cx which has as an upper concave envelope the line segment @) from 
(c, (b— c)"*) to (d,(b— d)"*"). Call fy the line through 0. The line ¢g intersects 
x-axis at (t*,0), where d < t* < b. We need to determine t*: The slope of é2 is 


je = PABA D (5G ge ta) 


k=0 
The equation of line £2 is 


Hence 


and 
ti=c- 


Next we consider the moment right triangle with vertices (c, (b—c)"*"), (c,0), 
(t*,0). Clearly (d1,0) is between (c,0) and (t*,0). Consider the vertical line = d,, 
it intersects 0) at Q*. Clearly then 


W,2 = length((d,,0), Q*), 
the line segment of which length we find by the formed two similar right triangles 
with vertices {Q*, (di, 0), (t*,0)} and {(c,(b—c)"*+), (c,0), (t*,0)}. We have the 
equal ratios 
t* — dy W2 
t*—c (b—c)rtl? 


2) Similar to above and clear. 


We make the useful 


Remark 24.21. By Lemmas 24.19, 24.20 we get 


n 


A:=W,+ We. = (32 Sa Wes 0) (d, — d) 


k=0 


+ (s20 —¢)** (b= 0) (e— di) +(d—a)"*++(b—c)"t1 >0, (24.45) 


k=0 
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n> 1. 
We present the important 


Theorem 24.22. Let f~ be absolutely continuous on [a,b] and f\”) € Lx(a,b), 
n> 1. Let p be a finite measure of mass m > 0 on ({c, d], P([c, d])), [c,d] € [a, b] C 
R. Furthermore we suppose that 


1 
— xd = dy € [c,d] (24.46) 
™ Jfe,d] 
is given. Then 
sup K< ile llsSse 3 ; (24.47) 
p as above n(n + 1)'(b— a) 
and 
K <RHS.(24.47), (24.48) 


where K is as in (24.28) and X as in (24.45). 
Proof. By Proposition 24.15 and Lemmas 24.19 and 24.20. 
We make 


Remark 24.23. We compare M as in (24.31) and (24.32) and \ as in (24.45). We 
easily obtain that 


<M. (24.49) 


As a result we have that (24.48) is sharper than (24.34) and (24.47) is sharper than 
(24.36). That is reasonable since we restricted ourselves to a subclass of M(c, d) of 
measures ys by assuming the moment condition (24.46). 


We finish chapter with 


Remark 24.24. I) When c = a and d = 6b then d; plays no role in the best 
upper bounds we found with the Geometric Moment Theory Method. That is, the 
restriction on measures p via the first moment d, has no effect in producing sharper 
estimates as it happens when a < c < d< _b. More precisely we notice that: 


1) 


R.HLS.(24.19) = Lee (b— a) = R.HS.(24.9), (24.50) 
2) by (24.45) here A = (b—a)"*! and 
— IF hoo n 
RHS.(24.47) = 7 Fy (b— a)" = RWS (24.38). (24.51) 


II) Further differences of general means over any [ci,d1] and [c2,d2] subsets of 
[a,b] (even disjoint) with respect to f41 and jue, respectively, can be found by the 
above results and triangle inequality as a straightforward application, etc. 
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Chapter 25 


Gruss Type Inequalities Using the 
Stieltjes Integral 


In this chapter we establish sharp inequalities and close to sharp, for the Chebychev 
functional with respect to the Stieltjes integral. We give applications to Probability. 
The estimates here are with respect to moduli of continuity of the involved functions. 
This treatment relies on [55]. 


25.1 Motivation 


The main inspiration here comes from 


Theorem 25.1.(G. Griiss, 1935, [205]) Let f,g integrable functions from [a,b] > R, 
such that m < f(x) < M, p< g(x) <0, V «€ [a,b], where m,M,p,0 € R. Then 


—_ ‘ " fla)gla)ae - aE ( | jor) ( | “ae\t) 


1 
The constant A is the best, so inequality (25.1) is sharp. 


Other very important motivation comes from [91], p.245 and S. Dragomir ar- 
ticles [155], [154], [162]. The main feature here is that Griiss type inequalities are 
presented with respect to Stieljes integral and the modulus of continuity. 


25.2 Main Results 


We mention 


Definition 25.2. (see also [317], p.55) Let f : [a,b] > R be a continuous function. 
We denote by 


wi(f,d):= sup |f(z)—fly)|, 0<d6<b-a, (25.2) 
x,y€ [a,b] 
|a—y| <6 


the first modulus of continuity of f. 
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Terms and Assumptions. From now on in this chapter we consider continuous 
functions f,g : [a,b] — R, a ¥ b, a is a function of bounded variation from |[a, b] 
into R, such that a(a) 4 a(b). We denote by a* the total variation function of a. 
Clearly here a*(b) > 0. 

We denote by 


b 
D(f,9) = —-——~ | f(@)a(2)da(a) 
ale) — ata) J, 


b b 
SD ESTOIE (/ Hyio()) (/ ne\do()) (25.3) 


this is Chebychev functional for Stieltjes integral. 


We derive 
Theorem 25.3. It holds 
i) 
b b 
ID( fr g)| < sau caor | ( : f(y) — F(x)\lg@y) - noo") da* (x), 
(25.4) 
and 
b b 
ID(f.9)I < WG aor i. ( | wi(f, |e — yl)wr(g, fe — ve") da (x). 
(25.5) 


Inequalities (25.4) and (25.5) are sharp, in fact attained, when f(x) = g(x) = 
id(a) := x, V x € [a,b] and a is increasing (i.e. da* = da). 
it) Let m< f(x) <M, p<g(x)<0,V vé€ [a,b]. Then 


DUD <5 (EOS) r= mylo—p). (25.6) 
When a is increasing it holds 
IDF. 9)| < 5(M — m)(o ~p) (25.7) 


We continue with 


Theorem 25.4. Assume f,g be of Lipschitz type of the form wi(f,6) < 116%, 
wi(g,6) < L26°,0< 6<1,11,L2>0,V6>0. 

Then 

i) 


Ii Ly y : 284,* . 
Dial < aa ( i ( [ @- veo w) do w). (25.8) 
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If a is increasing then again in (25.8) we have da* = da. 
Additionally suppose that a is Lipschitz type, i.e. J A > 0 such that 


la(x) — a(y)| < Ale — yl, V x,y € [a, 4). 


Then 


2 = 2(1+8) 
D(f.9)| < 5 en (25.9) 


(1+ B)(1 + 28)(a(b) — a(a))?” 
Next we have 


Theorem 25.5. For0 <6 <b—a tt holds 


1 faa Pca ee : 
Dol aaa | ( [|=] w) da ) wi (f,5)01(9,9). 
(25.10) 


When a is increasing then again da* = da. 
We derive 


Theorem 25.6. Jt holds 
i) 


<2( in) (stn (Loven) ar) 


(25.12) 


We also have 


Theorem 25.7. Here a is increasing. Then 


i) 


b b 1/2 
ID(f,9)| < 2w1 Go (/ (/ i= want) ios) 
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1 b b : 1/2 
XW ( A= aa) (/ (/ (a — y) a) io) , (25.13) 
and 
ii) 


i ; 1/2 
ID(f, f)| < 2w? ( hoa (/ (/ (a — vant) io) . (25.14) 


Proof of Theorems 25.3 - 25.7. Since f,g are continuous functions and a is of 
bounded variation from [a,b] — R, by [91], p.245 we have 


b b b 
K = K(f,9) = (a(0)—a(a)) / jeans) ( | Hieo() ( / soda) 


b b 
5 i ( | (f(y) — F(2))(o) — ae)aty da(a). (25.15) 


Hence by (25.15) we derive that 


1 b 
Ki< | 
2 Ja a 


b 
J (fo) = Fe) (alu) - @))aa{y)| da 
<i ( [ie seo - tie") da”). (25.16) 


So we find 


Kif.ol <= 5 f (fw f(y) — Fl@)lla(w) ~ alia”) ) (2) = (25.17) 


proving (25.4). 
By m < f(x) < M, p< g(x) <a, all x € [a,b], we obtain 


If(z) — fl < M—™m, 


I(x) — gy)|So—p, V z,y€ [a, }). 


Consequently, we see that 


(M — m)(o — p)(a*(b))?, (25.18) 


IK (f.9)1 < 5(a*(0))?(M ~ m)(o ~ p), (25.19) 


proving (25.6). 
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That is, for a increasing it holds 


(a(b) = a(a))? 
2 


K(f, 9)| S (M —m)(o — p), (25.20) 


proving (25.7). 
Let now f = g = id, id(x) := x, V x € [a,b] and a increasing, then by (25.17) 
we have 


b (pb 
|K (id, id)| = K(id,id) = >| (/ (y — “aay da(x) 


b b 
.. al (/ lay ado") da*(x), (25.21) 


by a*(x) = a(x) — a(a). 
That is proving inequality (25.17) is attained, that is a sharp inequality. 
We further derive 


b b 
m5 [ (fertile verte wer) doe (25.22) 


That is, 


b b 
wecnanl <§ f° ( fantle~ wheal te ata) dC) =n, (2528) 


proving (25.5). 

Since w1(id,d) = 6, V 6: 0 < 6 < b—a, the last inequality (25.23) is attained 
and sharp as (25.17) above. 

Let now f, g be Lipschitz type of the form 


wi(f,d) < 116%, 0<6<1, 


wi(g,6) < L268, L1,L2>0. 
Then 


Ly1L2 : ‘ B BAA * * 
pros 5 la — y|P|a — y|?da*(y) | da*(a) |. (25.24) 
That is, 


b b 
IK(f, 9) < a (/ (/ (a — vio") ior) =: pz, (25.25) 


proving (25.8). 
In the case of a: [a,b] > R being Lipschitz, i.e, there exists \ > 0 such that 


la(x) — a(y)| < Ale — yl, Ve,y € [a, B]. 
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We have for say x > y that 
a*(x) — a*(y) = Valy, 2] < A(x — y), 


ja*(a) —a*(y)| < Ala— yl, V e,y € [a9], 


hence a* is same type of Lipschitz function. 
Thus, see (25.25), we derive 


2 b b 
p3 < nen (/ (/ ee vai) i) (25.26) 


fs [1 LX7(b = a)2(8+1) 


Hence it holds 
Lila (b — a)?(8+1) 


KYO S 2(6+1)(26 +1) 


proving (25.9). 
Next noticing, (0< 6 <b—a), 


ani f (fz (=) or (EH) torn) aor 


(by [317], p.55) (here [-] is the ceiling of number) 


= i" (/ aa) ior) (25.29) 
ncisaiad <3] (f(1+ 22) dora) dre] esa 


wi(f, d)wilg, 6) = 


1 b b (a — y)? 2\x — y| . . 
: / (/ aoe ae ae a ) da ) wil, 8) (9,9) 
To stccnan thie y ae an : 
3 [i (b)) +3 (/ (/ (a — y)*da w) da o») 
b b 
4G (/ (/ oe") ) da (x ))] i(f, )w1 (9, 9) (25.31) 
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b b 1/2 
45 (/ (/ (a = y)°da* 0) ior) a* 0] wil(f, d)w1(g, 0) (25.32) 


(letting 


b b 1/2 
sees a0 (/ (/ om waa") dat o») 0, (25.33) 
here a*(b) > 0,0 < 6* < b—a) 


= lla" (o))? + (a*(b))? + 2(a*(b))"] 
wi(f, 5" )wr(g, 0") = 2(a*(b))?wr (Ff, O* wr (9, 5"). (25.34) 


That is, we have proved (25.10) and 


* 2 1 : : 2 * * - 
K(f. 9)] < 2(0"(0) ar (ist ( | ( [ @-wiaa w) da 9) 
1 ? : 2 * * i 
XW (ot (/ (/ (x — y)"da ) da 9) ; (25.35) 


establishing (25.11). 
In particular, we get 


K(f, A) = 


b b 1/2 
< 2(a"(0) Pu? (isin (/ (/ ir- waar) ior) } (25.36) 


proving (25.12). 
In case of a being increasing we obtain, respectively, 


|K(f,9)| < 2(a(b) — a(a))? 


1 b b ; 1/2 
Wy [ a(b) — ala) (/ (/ (x—y) ‘ow io) 
1 b b : 1/2 
XW ( a) ata) (/ (/ (a — y) ‘ow io) ' (25.37) 


proving (25.13), and 


b b 1/2 
KCL, )1<2(a(6)—a(a))2o} (aca ( / ( i (yaa) dot) ) 


(25.38) 


proving (25.14). 
We have established all claims. 
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25.3 Applications 


Let (Q,.A, P) be a probability space, X : Q — [a,b], a.e., be a random variable with 
distribution function F. 


Then here 
b b 
Dx(f,9): a= f f(a)g(@)dF («) - ( ‘| joie) ( ‘| aeyir(e)) 
(25.39) 
Dx(f,9) = EUf(X)a(X)) — E(f(X)E(G(X)). (25.40) 


By [91], pp. 245-246, 
if f, g both are increasing, or both are decreasing over [a, b], while 


Dx(f,9) <0 


if f is increasing and g is decreasing, or f is decreasing and g is increasing over 
(a, b. 


We give 
Corollary 25.8. It holds 

i) 

IDx(f.9)| < <3f ( [uw fv) — F@)llov) - aoae) dF(«), (25.41) 
and 


b b 
x(t. 5 f ( | o(sle= wate = sae) dP(x). (25.42) 


Inequalities (25.41), (25.42) are sharp. 
ti) Letm < f(x) < M, p< g(x) <o,V x€ [a,b]. Then 


IDx(F.9)| < 5(M —m)(o~ p). (25.43) 


Proof. By Theorem 25.3. 


Corollary 25.9. Assume f,g : wi(f,d) < 116%, wi(g,d) < L268, 0 < 6 <1, 
Iy,L,>0,V6>0. Then 


i) 
b b 
Dx(f.g)| << ( | ( | a) arte) , (25.44) 
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Additionally suppose that F' is such that I X > 0 with 
|F (x) — F(y)| < Ala —y|, V vy € [a, 2). 
Then 
ii) 
[1 L2d7(b = a)2(+8) 


IPM FI S SS +28) 


(25.45) 


Proof. By Theorem 25.4. 


Corollary 25.10. For 0 < 6 < b—a it holds 


i (/ [4] aw) dF (x) 


Proof. By Theorem 25.5. 


IDx(F.9I<5 wi(f,d)ar(9,5). (25.46) 


Corollary 25.11. It holds 


i) 
b b 1/2 
IDx(f.9)| <2un | f, (/ (/ wr war) irs) 
b b 1/2 
xwi lg, (/ (/ ir- wart) arte) ; (25.47) 


and 


|Dx(f, f)| = |E(F*(X)) — (Ef(X))?| = Var(f(X)) (25.48) 


b b 1/2 
90? | Ff, (/ (/ (x — vary) ir.) , (25.49) 


Proof. By Theorem 25.7. 
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Chapter 26 


Chebyshev-Gruss Type and Difference of 
Integral Means 
Inequalities Using the Stieltjes Integral 


In this chapter we of the establish tight Chebychev-Griiss type and comparison of 
integral means inequalities involving the Stieltjes integral. The estimates are with 
respect to || - ||p, 1 <p < oo. At the end of the chapter we provide applications 
to Probability and comparison of means for specific weights. We give also a side 
result regarding the integral annihilating property of generalized Peano kernel. This 
treatment follows [51]. 


26.1 Background 


The result that inspired the most this chapter follows. 


Theorem 26.1. (Chebychev, 1882, [129]) Let f,g : [a,b] — R absolutely continuous 
functions. If f’,g' € Loo([a, }]), then 


— - : f(a) g(a)dx — IE (sem) [ [see] | 


1 2 / / 
< —(b—-— : 
< 550-0)" IF'leolld' llc 


Next we mention another great motivational result. 


Theorem 26.2. (G. Griiss, 1935, [205]) Let f,g integrable functions from [a,b] > 
R, such that m < f(a) < M, p< g(a) <o, for all x € [a,b], where m, M, p,o ER. 
Then 


— [ seat - IE (ner) [ [se] 


Also, we were inspired a lot by the important and interesting article by 5.S. 
Dragomir [162]. 
From [32] we use a lot in the proofs the next result. 
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Theorem 26.3. Let f : [a,b] — R be differentiable on [a,b]. The derivative 
f' : [a,b] — R is integrable on [a,b]. Let g : [a,b] > R be of bounded variation and 
such that g(a) 4 g(b). Let x € [a,b]. We define 


g(t) — g(a) 
a0) — g(a) = = ™ 
P(g(x), g(t)) = mest (26.1) 
g(t) — 9 
a6) g(a)’ <=? 
Then 
1 b ‘ 
f(z) = Goat) f(t)dg(t) + : P(g(2), g(t) f (dt. (26.2) 


26.2 Main Results 


We present the first main result of Chebyshev-Griiss type. 


Theorem 26.4. Let fi, fo : [a,b] > R be differentiable on [a,b]. The derivatives 
fi, f% : [a,b] — R are integrable on [a,b]. Let g : [a,b] — R be of bounded variation 
and such that g(a) 4 g(b). Here P is as in (26.1). 

Denote by 


1 b 
0= pay | A@Alwds(a) 


1 b 1 b 
= (ata! f(s) (ata! float) (26.3) 


where g* is the total variation function of g, i.e. g*(x) := V,la,z], for x € (a,)), 
where g*(a) :=0. 
1) If fis fa € Loo((a, 4]), then 


|A| < 


Ta) — gay ll filloellfalloo + I Fallool Fille 


b b 
<(f (fie.atenlas) aoe) <a, (26.4) 


2) Let p,q > 1: =1. Suppose fi, fs € Lp([a,6]). Then 


= Dg(b) — lay] Falell Falloo + || Fi llpll fellec] 


b 
x ( \Pt2). a(t") = Mo. (26.5) 
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When p = q = 2, then 
1 


b 
x (/ \Pt2). a(t") =: M3. (26.6) 
3) Also we have 


1 / / 
|4| < Fob) — olay) Falla Falloo + [Ifillall felloo] 


«(fie |P(g £))|loo,2dg “)) =: M4. (26.7) 


Proof. By Theorem 26.3 we have 


file coca | fila)dg(x +f P(g(2),9(2)) fi (a)ae, 
and 
1 : ; 
f= oy | faloddate) + | Plol=).ale))s5(e)ae 
all z € [a, 6]. 


Consequently we get 


— __ fale) : x Zz ; z), g(x)) fi (x)dx 
Ale)ale) = BD [ ptwrdala) + fale) [ Pole). ote si(e)dr, (26.8) 


and 
Si (2) fo(2) 


all z € [a, 6]. 
Hence by integrating the last two identities we obtain 


b 1 b b 
| Aloraleiate) = A ( | Atea()) ( fateh) 


b b 
= i | fala)dg(z2) + fala) | P(g(z), 9(2)) f3(x)dx, (26.9) 
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b b 
£ / PO ( , Plo). fle dg(2). (26.11) 


Therefore we find 


b b 
=| filz) (/ Plo().t2) sled] dg(z). (26.12) 


So we derive that 


b b 
+ / fol2) ( , Plo().ata) stoi] 7) : (26.13) 


1) Estimate with respect to || - ||... We observe that 


1 
il $ S50 — ata 


b b 
x ( / ( / Plate). ott) wo") ; (26.14) 


where R.H.S. (26.14) is finite and makes sense. That is proving the claim (26.4). 


[Il fullooll falloo + II fallooll fille] 


2) Estimate with respect to ||: ||p, p,q > 1 such that — + — 
P @q 
We do have 


i / : * 
Il < Halal ist f fi(2)IP), 9(@)lawdg" (2) 


b 
+llfile f |fo(z)IIIP(9(2), 9(2) lla wd" (2) (26.15) 
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1 


b 
= Bg(b) —glay Fall Falloo + IIfillell felloo] (/ |P(9(z), 9(2)Ilq.2dg" (2) 


(26.16) 
where R.H.S. (26.16) is finite and makes sense. That is proving the claim (26.5). 
3) Estimate with respect to || - ||,. We see that 


i / ; * 
I < alo sn f If (2) PCa), 9(@))lloo,ndo" (2) 


b 
+f f (P(t) (26.17) 


1 : , b ; 
< Bg(b) — lay) ll Fallall Falloo + | fillallfelloo] (/ \|P(9(2), 9(2))|loo,2d9 ) ; 


g(a)| 
(26.18) 
where R.H.S. (26.18) is finite and makes sense. That is proving the claim (26.7). 
Justification of the existence of the integrals in R.H.S. of (26.14), (26.16), (26.18). 


i) We see that 
= |P(g x))|dx 


ay |g(x) — sacha lg(x) — g(b)|dx =: I,(z) + L(z). (26.19) 


Both integrals I, (z), I2(z) exist as Riemann integrals of bounded variation func- 
tions which are bounded functions. Thus ,(z), I2(z) are continuous functions in 
z € [a,b], hence I(z) is continuous function in z. Consequently 


[ (fire P(g hati) a “= [102 (26.20) 


is an existing Riemann-Stieltjes integral. 
ii) We notice that (q > 1) 


1/q 
I,(z) -= ||P(9(2), 9(@) lac = ( [re |P(g haha) 


y 1/q 
/ \g(a) — ataitae+ [la \g(x ‘and = [Tug(z) + Log(z)]'/2. (26.21) 


Again both integrals [,,4(z), I2,q(z) exist as Riemann integrals of bounded vari- 
ation functions which are bounded functions. 

Clearly here by g being a function of bounded variation, |g(x)—g(a)|, |g(x)—g(6)| 
are of bounded variation, and |g() — g(a)|?, |g(x) — g(b)|? are of bounded variation 
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too. Thus [1,4(z), [2,q(z) are continuous functions in z € [a,b], hence I,(z) is a 


continuous function in z. Finally 
b 


b 
: IP). 9(@)llaadg* (2) = / Iy(2)dg"(2) (26.22) 


a 
is an existing Riemann-Stieltjes integral. 


iii) We observe that 
A(z) = ||P(9(z), 9(2)) ll co, ne [a,b] (26.23) 
= max{||9(z) — 9(@)|loo,e[a,z} Il9(2) — 9(8)lloo,we[z,b) } (26.24) 
= max{||h1(2)||o0,c€[a,z}> |ha(x)|loo,xe[z,b] } = max{Aj(z), Ao(z)}, (26.25) 
for any z € [a, }]. 
Here A, is increasing in z and Ag is decreasing in z. That is, A;, Az are functions 


of bounded variation, hence h(z) is a function of bounded variation. 
Consequently 


b b 
i IP(9(2),9(2) loo.edg"(2) = i h(z)dg*(z) (26.26) 


is an existing Lebesgue-Stieltjes integral. 
The proof of the theorem is now complete. 
We give 


Corollary 26.5. (1) All as in the basic assumptions and terms of Theorem 26.4. 
Tf fi, fa € Loo([a, bl), then 
|0| < min{ M1, Mo, Ms, M4}. (26.27) 


(2) All as in the basic assumptions of Theorem 26.4 with g being increasing. 
Then 


1 ; / 
< Fp) — play allecll fall + WPallooll Fill] 


b 
x ( ral) ~ ala) ~ a2) (26.28) 


Proof. By Theorem 26.4. 

Next we present a general comparison of averages main result. 
Theorem 26.6. Let f : [a,b] — R be differentiable on [a,b]. The derivative 
f’ : [a,b] — R ts integrable on [a,b]. Let gi, 92 : [a,b] > R be of bounded variation 
and such that gi(a) 4 gi(b), go(a) 4 go(b). Let x € [a,b]. We define 

gilt) — gi(@) 
gi(b) — gi(a) 
P(gi(x), gi(t)) = (26.29) 
so<t<b, 


,ast<a@, 


gi(b) — gi(a) 
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fori =1,2. 
Let [c,d] C |a, b]. 
We define 


1 a if ss 
= Gay | £2 - Gay [FO 8.80) 


Here g3 is the total variation of go. 
Then 
i) If f’ € Leo([a, b]), it holds 


1 d b . ‘ = 
al< atm | | ( : Plas.) toto) I flloc =: Ex. 


(26.31) 
1 1 
ti) Let p,q > 1: oo aoe 1. Suppose f' € L,([a,b]). Then 


d 
| [Vreven Haein IIF’llp,ta,b) =: E2- 
(26.32) 


1 
Ce nO 


When p = q = 2, it holds 


d 
| i \Pia(.m(laria eile) If’ ll2,fa,o) =: Es. 
(26.33) 


1 
P| =a) 


iti) Also we have 


1 d 
|A| < PACER (/ IPC. a (aie) IIf’Il1,ta,o) =! Es. 
(26.34) 
Proof. We have by Theorem 26.3 that 
1 b 
0) = Boy maay f, Loan 
b 

+ P(gi(x), gi(t)) f’ (t)dt, V x € [a,b]. (26.35) 


Here [c, d] C [a,b] and by integrating the last (26.35) we obtain 
[P steraoate) = (29=28) F peoanuey 


d b 
ay | ( i aeianel dga(2). (26.36) 
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1 ; 1 ° 
A:= wars | f(a)dgo(x) — apo f(t)dgi(t) 


d b 
oe! i ( if Ploo) nes") ios). 


Next we estimate A. 
1) Estimate with respect to |] - ||co. 
By (26.37) we find 


1 ! 
Als gy sa (if (/ IP(n(2),r(¢ pia) dgS (a = 
+ 


2) Estimate with respect to || - ||p, here p,q > 1: 
We have by (26.37) that 


1 
eS EO ae 


When p = q = 2 we derive 


d 
/ (Pla. 1(? Naan) IIf'llp,ta,b1- 


1 
eC =raG] 


3) Estimate with respect to |] - ||1. 
We observe by (26.37) that 


d 
/ (Pla). ()lanaaeste) II F’ll2,ta,0)- 


1 
|A| < 
\go(d 


lesa) a0) (f IP (g1(2), 91(€)) lloo,t, [a,b] E99 (© ) If’Il1,[0,0)- 


Similarly as in the proof of Theorem 26.4, we establish that 


d b 
: (/ Plato.) dg (a) 


is an existing Riemann-Stieltjes integral. Also 


d 
i IP(o1(2), 91 NM ase tayo (2) 


is an existing Riemann-Stieltjes integral. 


Finally 


d 
i P(91(2), 91(¢))Ilco,t,021495 (2) 


is an existing Lebesgue-Stieltjes integral. 
Therefore the upper bounds (26.38), (26.39), (26.40), (26.41) make sense and 
exist. 


(26.37) 


(26.38) 


(26.39) 


(26.40) 


(26.41) 


(26.42) 


(26.43) 


(26.44) 
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The proof of the theorem is now complete. 
We give 


Corollary 26.7. All as in the basic assumptions and terms of Theorem 26.6. 
If f' € Loo([a, b]), then 


|A| < min{ Ey, Eo, E3, Es}. (26.45) 


Proof. By Theorem 26.6. 
We also give 


Corollary 26.8. All as in the basic assumptions of Theorem 26.6 with gi, go being 
increasing. Then 


|A| << ————__~ an (f max(gi() — gi(a), gi(0) ~ ante) Il FIl1,[a,6)- 


(26.46) 


(g2(d) — ga(c)) 


Proof. By (26.34). 
We make on Theorem 26.6 the following 


Remark 26.9. When d| = [a,b] we make the next comments. In that case 


* 1 
A et CNET AC) =a sf a )dgo(t) -catas | f(t)dgi(t) 


-GU—aw “f (/ P(gi(2), 91 rea) tote), (26.47) 


and the results of Theorem 26.6 can be modified accordingly. 
Alternatively, by Theorem 26.3 we have 


fe) = yay <f 10 (t)dgo() + f Plow go(t)) fat, Ve € [a, B]. 
(26.48) 

Thus 

b b 

[ serante) = (AGES) [ seodnte 
b b 
ae / ( | Plato). | dgi(x), (26.49) 
and 


1 b 1 b 
Gann | £40 - Say [Hdl 
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: / 
~ (gi(b) — gi(a)) ao O P(g2(2), go(t))f ia) in()). (26.50) 


Finally we derive that 


Ts Son s(/ [ [rate ,ga(t roa) ints). (26.51) 


Reasoning as in Theorem 26.6 we have established 


Theorem 26.10. Here all as in Theorem 26.6. 
Case [c,d] = : b]. We define 


A* = cua | f ()dga(t) coum | 1 f(Q)dgi(t). (26.52) 
— gala — gia 

Here gj 1s oe total variation function g1- 

Then 


i) If f’ € Loo([a, bl), it holds 


P 1 , B * ’ = 
IA <a | iz ( i: Plate. sate) tate) Iflloc =: Fi. 


(26.53) 
1 1 
ti) Let p,q > 1: aD ‘ =1. Suppose f' € Lp([a,b]). Then 


; 1 
= Ga) 


b 
/ (Poe. ol Dart) IIF’llp,fa,0) =! He- 


(26.54) 
When p = q = 2, it holds 


1 


eNO = ae 


b 
) (Plo. sl Manoa) Il F'll2,{0,6] =: Hs. 


(26.55) 
iti) Also we have 


1 b 
|A*| < rOErIG (/ |Past) Ifa fa,o) =! Ha. 
(26.56) 


We give 


Corollary 26.11. Here all as in Theorem 26.6. 
Case {c, d] = [a,b], A* as in (26.52). Then 
i) If f’ € Loo([a, b]), it holds 


|A*| < min{ E,, Hy}, (26.57) 
where Ey as in (26.31), Hy as in (26.58). 
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1 1 
a) Let p,q >1: -+—=1. Suppose f' € L,([a,b]). Then 
P @q 


|A*| < min{ F2, Ho}, 
where E2 as in (26.32), Hz as in (26.54). 
When p = q = 2, it holds 
|A*| < min{E3, H3}, 
where E3 as in (26.33), Hz as in (26.55). 
iii) Also we have 
|A*| < min{ Fy, H4}, 
where Ey as in (26.84), Ha as in (26.56). 
Proof. By Theorems 26.6, 26.10. 
We continue with 


Remark 26.12. Motivated by Remark 26.9 we expand as follows. 
Let here f € C([{a, 6]), 9,91, 92 of bounded variation on [a, J. 
We put 

g(t) —gla),a<t<a, 
P*(g(x), g(t) = 
g(t) — g(b), e<t<b, 
x € |[a, b]. 
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(26.58) 


(26.59) 


(26.60) 


(26.61) 


Similarly are defined P*(g;(x), g:(t)), «7 = 1,2. Using integration by parts we 


observe that 


fo - saya = sto - 9a]. - fF s4a910 


and 


Adding (26.62), (26.63) we get 


x b b 
J ow -sayare + fo -9@)ar& = Fe) - 9) - | 


That is 


(26.62) 
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Hence 


b b 
/ (/ Pale) oar) dg(x) 


b b 
. ( / sia) (a(b) — g(a)) - ( if revo) (a(8) — g(a) =0, 


that is proving 


b b 
/ ( > Pate) otyar) dg(«) = 0. 


(26.66) 


(26.67) 


If f € T([a, bl) := {f € C([a, b]) : f’(x) exists and is finite for all but a countable 
set of x in (a,b) and that f’ € Ly([a,b])} or f € AC({a, b]) (absolutely continuous), 


then 


b b 
/ ( i Pate.) F¢0a) d(x) = 0. 


Additionally if g = id, then 


: ([ Poor) ey 
If g = f = id, then 
a (/ Presa die AU; 


t-—a,a<xt<az 
P*(a,t) = 


t—ba<t<b. 


where 


Equality (26.70) can be proved directly by plugging in (26.71). 
Similarly to (26.65) we have 


b b 
| Pa). a)ate) = F@(910) - na) - f f(t)dgi(t), 


and 


b b 
/ P*(go(at), g2(8)) df(t) = F(e)(92(0) — 92a) - ‘| f(#)dg2(2). 


(26.68) 


(26.69) 


(26.70) 


(26.71) 


(26.72) 


(26.73) 


Integrating (26.72) with respect to gz and (26.73) with respect to g; we have 


b b 
ih (/ Pa (ohn(eyar) dg2(x) 
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b b 
= (/ Heat) (91(6) — g1(a)) — (/ ro) (92(b) — g2(a)), (26.74) 
and 
b b 
/ (/ P* ato) oatt)v)) dgi (x) 


b b 
. ( 7 Hen) ott) oo) ~( / ra) (g1(b) — gi(a)). (26.75) 


By adding (26.74) and (26.75) we derive the interesting identity 


b b 
if (/ Pato) one)are)) dg2(«) 
b b 
+f (/ P* ato) oat) )) dgi(x) = 0. (26.76) 


If f € T([a,b]) or f € A C([a, d]), then 


b b 
iy ( / Pa tohon@)s' oa) ise 
+f (/ P* (go(), ga(t roa) dgi(x) = 0. (26.77) 


Next consider h € L,({a,6]) and define 
H (a) =| h(t)dt, ie. H(a) =0. (26.78) 


Then by [312], Theorem 9, p.103 and Theorem 13, p.106 we get that H’(x) = 
h(a) a.e. in [a,b] and H is an absolutely continuous function. That is H is contin- 
uous. Therefore by (26.67) and (26.68) we have 


‘i (a P*( pat) at = 0. (26.79) 


Also by (26.76) and (26.77) we derive 


oe 
+f (fr gala), galt pee dgi(x) = 0. (26.80) 


From the above we have established the following result 
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Theorem 26.13. Consider h € Li({a,b]), and g, 91,92 of bounded variation on 
[a, b]. Put 


oe —g(a),a<t<zg, 
(26.81) 


V a € [a,b]. 
Similarly define P*(gi(x), gi(t)), i= 1,2. Then 


[ (ew P* (g(x), g(t))h (0)mo) dg(x) = 0, (26.82) 


b b 
‘i ( if amead pate) 
b b 
+ i ( i P* gta) salt)at) dg (a) = 0. (26.83) 


Clearly, for g1 = 92 = g identity (26.83) implies identity (26.82). 


and 


We hope to explore more the merits of Theorem 26.13 elsewhere. 


26.3 Applications 

1) Let (0,.A,P) be a probability space, X : Q — [a, }], a.e., be a random variable 
with distribution function F’. Let fi, fo : [a,b] — R be differentiable on [a, 6], fy, 4 
are integrable on [a,b]. Here P as in (26.1). Denote by 


b 
Ox (fis fa) == i ful) fol) dF (x) 


-( fn f(o)dF (a 9) ( [m0 folo)d F(a i), (26.84) 


That is, 
Ox (fi, fo) = E( A(X) fa(X)) — BUA (X)) E(fo(X)). (26.85) 
In particular we have 
Ox (fi, fi) = E(f7(X)) — (E(fi(X)))? = Var(fi(X)) > 0. (26.86) 


Then by Theorem 26.4 we derive 


Corollary 26.14. It holds 


|x (fi, f2)| S sll fallecll lle + [I fallooll fille] 
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<(f ( [rer |P(F Foap\te) 4 F(z ) =e My. 
it) Let p,q > 1: + ; =1. Suppose fi, ff € Lp({a, b]). Then 


|Ax(fi, fo)| S S[llfallpll fille + II fillpll falloc] 


1 
2 


<(f ||P(F L))||q,20F (z 2) = : Mo. 


When p = q = 2, then 


sll alles + Ill falc 


“(/ IPE), F(a) lnndF (2 )) Ts. 


|9x (fi, f2)| S sll.fslalfallos + Fill Fallen 


(/ | PF F()lnath(2)) =: M4. 


Corollary 26.15. If f, £5 © Loo([a,b]), then 


|Ox (fi, fa)| < 


iii) Also we have 


|Ax (fi, f2)| < min{M), M2, M3, M4}. 


Proof. By Corollary 26.14. 
Next we estimate Var(f(X)). 


Corollary 26.16. It holds 
i) If f' € Loo((a, b]), then 


Var(f(X)) < [Iflleclllloc ( [ ( fv IPF Fate) 4 rte) saa 


1 1 
it) Let p,q > 1: a . =1. Suppose f' € Lp([a,b]). Then 


b 
Var(f(X)) < Iflloll f'llp (/ IPF) Fendt) =: My. 
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(26.87) 


(26.88) 


(26.89) 


(26.90) 


(26.91) 


(26.92) 


(26.93) 
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When p = q = 2, then 


b 
Var(f(X)) < IIflloll f'lle (/ IPC) Feat =: Ms. 


iti) Also we have 


b 
Var(f(X)) < Ilfllooll fh (/ \PU). Fe nath()) =: My. 


Proof. By Corollary 26.14. 
Corollary 26.17. If f’ © Lo([a,6]), then 
Var(f(X)) < min{ M7, My, M3, Mz}. 


Proof. By Corollary 26.16. 


(26.94) 


(26.95) 


(26.96) 


2) Here we apply Theorem 26.6 for gi(x) = e”, g2(x) = 3”, both continuous and 


increasing. 
We have dgi(x) = e*dx and dgo(x) = 37 In3dz. 
Also f : [a,b] — R is differentiable and f’ € L1({a,b]). Define 


ef — e% 
; WastsTg, 
e>—e 
Pere") 4= 
at b 
; ,a<t<b, 
e€ 
and 
3! — 3% 
a ga SE 
P38", 3) )s= 
gt _ b 
a<t<b 
35 _ 3a’ 
Let [c, d] C [a, b] 
Set 


A(e”, 3”) := = Gi asf f(a 2)sde— oy oy RS aa 
We give 


Corollary 26.18. It holds 
i) If f! € Loo((a, b]), then 


gee (In 3)|| f'llo0 
|A(e*,3")| < (e — e4)(34 — 39) 


(26.97) 


(26.98) 


(26.99) 
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1 c 2 
~ (Gop) BAB0- Gey"}In3) 


+e7(1 + 1n3){3° — 37 — 3°ln 3 + 371In3 + (3° — 37) +aln3 —3°cln3 + 3%dIn3} 


+e°(1 + In3){3° — 34 — 3°ln3 + 34In3 + (3° — 3%)bIn3 — 3°cln3 + 34dIn a} 


=; Fy. (26.100) 
1 1 
it) Let p,q > 1: mal 2 =1. Suppose f’ € Lp({a,6]). Ifq¢N, then 


(In 3)1I fllp.fa,0 


|A(e*,3*)| < (ec? — e4)(84 — 35) * 


ae 3” {= Fi [—4, —q, 1— q, e*] + (-1)¢*1 oF, [-4, —q, 1— q, era*)) 
c q 


+nCSC(mq)((—1)4e% — et) } 1/4 dx} 


l| 
ol 


(26.101) 
When p = q = 2, it holds 


(In 3) IF’ ll2,[a,0) 


|A(e*, 3”)| < Vale’ — e%)(34 — 3%) * 


d 
tf 3° ( —4ert# 4 dert+# + e2b(—3 4 2b — 2x) + €74(3 — 2at 22)) is} 


oe (26.102) 
When q,n € N — {1,2}, then 


(In3) fll 10,0) 
(e° = e%) (34 = 3°) es 


{fe {are (a 3 (Ry =) 


|A(e*, 3”)| < 
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iti) Also we have 
(In 3)||f’lltja,e) X ® 


|A(e”,3”)| < (oP — e8)(34 35) =: Bs. (26.104) 
Here ; 
m,ifn(S$") oa 
2 
ev +e? 
®:= ¢ ®@o, ifln ( 5 ) <c (26.105) 
et +e 
Bs, ifesin( 5 ) <a 
where F er 
((3e)* — (3e)%) (In 3) — (3° — 3%)e?(1 + In3) 
Qe 26.1 
: In 3(1 + In3) kee) 
(—(3e)° + (3e)*) In3 + (3° — 3%)e*(1 + In3) 
99 == — 26.1 
: In 3(1 + In3) Penn 
1 a b a b 
@.:= a—(In 2) (In 3)+(In 3) In(e* +e”) b—(In 2)(In 3)+(In 3) (In(e*+e”)) 
3°= Daa pina)” oo 


+((3e)° + (3e)%)(In 3) — 3%e%(1 + In3) — 3%e?(1 + In3)]. 


Proof. By Theorem 26.6 and (26.46). More precisely we have: 
i) If f’ € Loo ([a, 6]), then 


Ae, 8°) < Gras (f (/ P(e" itt) sae) lle 


1 
ii) Let p,q >1: -+-—=1. Suppose f’ € L,([a, b]). Then 
q 


|A(e*, 3°)| IF llp.fa,s): 


In3 ¢ “id P 
a (37 — 3°) |P(e*, e°)lIq,t,[a,03" dx 
When p= q = 2, it holds 
In3 


|A(e*,3*)| < G43 


d 
/ [Pee aii Il f’ll2,fa,0)- 


iii) Also we have 


ln 
|A(e”,3”)| < wo (/" max(e” — e® eae ne) lf’ Ila, rae 


Integrals are calculated mostly by Mathematica 5.2. 
We finally give 
Corollary 26.19. If f’ € L.o([a,}]), then 


|A(e”, 3”)| < min{ Fy, Fa, Es, Eu, Es}. 


Proof. By Corollary 26.18. 


(26.108) 


(26.109) 


(26.110) 


(26.111) 


(26.112) 


(26.113) 


Chapter 27 


An Expansion Formula 


In this chapter Taylor like expansion formula is given. Here the Riemann-Stieltjes 
integral of a function is expanded into a finite sum form which involves the deriva- 
tives of the function evaluated at the right end point of the interval of integra- 
tion. The error of the approximation is given in an integral form involving the nth 
derivative of the function. Implications and applications of the formula follow. This 
treatment relies on [36]. 


27.1 Results 


Here we give the first and main result of the chapter. 


Theorem 27.1. Let go be a Lebesgue integrable and of bounded variation function 
on [a,b], a< b. We set 


q(x) = is go(t)dt,... (27.1) 


Qn(x) = a Fa oot neéN, x € {a,b}. (27.2) 


Let f be such that f"— is a absolutely continuous function on [a,b]. Then 


b n-1 
if fago = S>(-1)¥ f(b) gx(b) — F(a)g0(a) 
k=0 : 
+9 fon als (tat (27.3) 


Proof. We apply integration by parts repeatedly (see [91], p. 195): 
b b 
J £490 = $@)g0(0) - flagola) — f gos'at 
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and 


b b b 
i gof'dt = / fldg: = f'(0)g1(0) - f'(@)gr(a) - / of''dt 


a 


b 
= f'(b)91(b) - / of dt. 


a 


Furthermore 


b b b 
/ f'gdt = / f'"dgo = "(b)g2(b) — f"(a)ga(a) - i oofat 


a 


b 
= f"(®)g2(b) - / gofdt. 


a 


So far we have obtained 


b 
/ fdgo = f(b)g0(b) — f(a)go(a) — f'(B)o1(0) 


b 
+ f""(B)go(b) - : oof" at. 


Similarly we find 


b b b 
/ gof"dt = / fi"dgs = f'"(b)gs(b) — / asf dt. 
That is, 
b 
| fdgo = F(6)go(b) — F(a)go(a) — f'(b)91(0) 


b 


+ f"(b)ga(b) — f(b) 93(b) + i, asf dt. 


a 


The validity of (27.3) is now clear. 
On Theorem 27.1 we have 


Corollary 27.2. Additionally suppose that f™ exists and is bounded. Then 
b n-1 


fdgo — S°(-L) Ff (b)gu(b) + F(a) G0(@) 


a k=0 


| Gn—1(t) f( (t)dt 


b 
< [If Ilec / lgn—1(t)|dt. (27.4) 


As a continuation of Corollary 27.2 we have 


Corollary 27.3. Assuming that go is bounded we derive 
b n-1 


fdgo — S>(-D)* f™ (6) gx(b) + f(a)g0(a) 


¢ k=0 


< IF lcllgollee —E. (27.5) 
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Proof. Here we estimate the right-hand side of inequality (27.4). 
We observe that 


nal =| f° Santa 
(ge g)"-2 
< [SF oteat 
i 
< mee f (x — t)"~*dt 
Mails de i 
~ (n= 1) i 
That is 
Jn-a(t)| $ lool, allt € [a,b 
Consequently, 
b ain 
J gmat < (gol = 


A refinement of (27.5) follows in 


Corollary 27.4. Here the assumptions are as in Corollary 27.3. We additionally 
suppose that 


If Ilo SK, Wn >1, 


i.e., f possess infinitely many derivatives and all are uniformly bounded. Then 


b n—-1 
/ fagy — S>(-1)* f(b) gx(6) + f(a) ge(a) 
a k=0 
< K\gollo P=", vm > 1. (27.6) 


A consequence of (27.6) comes next. 


Corollary 27.5. Same assumptions as in Corollary 27.4. For someQ <r <1 it 
holds that 


O(r") = K||9ollo0 @ a" +0, asn— +00. (27.7) 
That is 
n—-1 b 
lim yt Gon) = f Fdo0 + Fagot). (27.8) 
k=0 a 


At 


ar 7 Oasn— +00. Call 


Proof. Call A := b—a> 0, we want to prove that 
A” 
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Notice that 
A 


=e 
But there exists ng EN: n> A-—I,ie., A<no+1, that is, r:= ah 
take no := [A —1]+1, where [-] is the ceiling of the number). Thus 


In+1 Dis WPS 2 pets 


< 1 (in fact 


Ing+1 = 7C, 


where 
Ei= Fry > 0: 
Therefore 
A ei 
x 25> —Tet i= re rc 
not no +2 not mm 2 , 
i.e. 


2 
Inot2<7T'C. 


Likewise we get 


Tnrot+3 < ree. 
And in general we obtain 
0< anak <r e-cHct-r™t*®, KEN 
where c* := =. That is 
O0<an <cir®, VN>no4+1. 


Since rX 


(b=a)” 
n! 


— 0as N — +c, we derive that rx — 0 as N — +o0. That is, 
— 0, as n > +00. 


Remark 27.6. (On Theorem 27.1) Furthermore we observe that (here f € 
C” (a, 6])) 


b 
/ fdgo + f(a)g0(a) 
27:3) |S) yk (k) ea (n) 
ZS |S* (1) )ge(b) + (-1) ) dn—a(t) f (tat 
k=0 a 


n—-1 b 
< TIF lleclge()| + flac ‘ ln (é)|at. 
k=0 G 


Set 


E=max{||f loa IF lors 1 F lleok (27.9) 
Then 


b 
/ fago + f(a)go(a) 


n-1 b 
< £15 Ini + if inact neN fixed. (27.10) 
k=0 = 
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27.2. Applications 


(I) Let {m}men be a sequence of Borel finite signed measures on [a,b]. Consider 
the functions go,m(X) := Um|a, 2], x € [a,b], m € N, which are of bounded variation 
and Lebesgue integrable. Clearly (here f € C™[a, b]) 


b b 
[fate = fF Fd. + Gamnka)- F(a): (27.11) 


We would like to study the weak convergence of 4m to zero, as m — +00. From 
(27.10) and (27.11) we obtain 


i fits 


Here we suppose that 


n-1 b 
San 2 Igim(b)| +f m-amtoyeeh VmeN. (27.12) 


k=0 


Jo,m(b) aed 0, 


and 
b 
i |Jom(t)|dt + 0, as m— +00. (27.13) 


Let k € N, then 


by 4)k-1 
k.m(b) = / Cinta 


by (27.2). 
Hence 


b — g)k- 
|g%,m(b)| < (/ ant) ear aan 


That is, 
|9k,m(6)| +0, Vk EN, m— too. (27.14) 


Next we have 


= 

Thus 
Jnana) S [PF ao nl) 
g Fa [ bgoon(tat 
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Consequently we obtain 


b —a n-1 b 
[gna mbaide < FF gam (t)at — 0 


as m— +co. That is, 


b 
/ lGn—1,m(t)|dt > 0, as m— +00. (27.15) 


Finally from (27.12), (27.13), (27.14) and (27.15) we derive that 


b 
/ fdum 0, as m— +o. 


That is, 4, converges weakly to zero as m — +00. 

The last result was first proved (case of n = 0) in [216] and then in [219]. 

(II) Formula (27.3) is expected to have applications to Numerical Integration 
and Probability. 

(III) Let [a,b] = [0,1] and go(t) = t. Let f be such that f("-) is a absolutely 
continuous function on [0,1]. Then by Theorem 27.1 we find g,(x) = —— all 
n €N. Furthermore we get 


1 es _ , f®(1) (=1)" i" aes 
[eX 1) (anit je (t)dt. (27.16) 


n! 
k=0 


One can derive other formulas like (27.16) for various basic go’s. 


Chapter 28 


Integration by Parts on the 
Multidimensional Domain 


In this chapter various integrations by parts are presented for very general type 
multivariate functions over boxes in R™, m > 2. These are given under special and 
general boundary conditions, or without these conditions for m = 2,3. Also are 
established other related multivariate integral formulae and results. Multivariate 
integration by parts is hardly touched in the literature, so this is an attempt to set 
the matter into the right perspective. This treatment relies on [34] and is expected 
to have applications in Numerical Analysis and Probability. 


28.1 Results 


We need the following result which by itself has its own merit. 
Theorem 28.1. Let f(11,...,%m) be a Lebesgue integrable function on x, [ai, bi], 
méEN; a; < bj, i=1,2,...,m. Consider 
y 
F(21,%2, pies (Pt) = / f(x1, £2, Irs ,Lm—1,t)dt, 

am 
for allam <y < bm, @ := (01, %2,...,;2m-_—1) € Kassel. 
Then F(21,%2,...,U%m) is a Lebesgue integrable function on x, [aj, bi]. 


Proof. One can write 


y 
PEW =f $@ bat 


Clearly f(%,-) is an integrable function. Let y > yo, then Ve > 0, 4d := d(e) > 0: 
y Yo 
[ tena f f(#,t)dt 


Yo 
/ HP. <e, with |y— yo| <4, 
y 


|F(2’,y) — F(@,yo)| = 


by exercise 6, p. 175, [11]. Therefore F (2, y) is continuous in y. 
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Next we prove that F is a Lebesgue measurable function with respect to 7. 
Because f = f+ — f-, it is enough to establish the above for f(2’,t) > 0, for all 
(a ,t) € x, [a;,b;]. By Theorem 17.7, p. 131, [11], since f is Lebesgue measurable 
there exists a sequence {¢,} of simple functions such that 0 < ¢,(@,t) 7 f(@,t) 
for all (a ,t) € x™,[a:,b;]. Indeed here 0 < $n(@,t) < dnyi(@,t) < f(@,t). 
These are defined as follows: 

For each n let 

At := {(@,t) € x™,[a:, bs] : @—1)2-" < f(@,) < 12} 
for i = 1,2,...,n2", and note that A’. A? = ¢ if i 4 j. Since f is Lebesgue 
measurable all A‘, are Lebesgue measurable sets. Here 


n2” 


ae (i — 1) x4, (@, 4), 


where y stands for the aed ates reacties 
Let AC X x Y in general and x € X, then the x-section A, of A is the subset 
of Y defined by 
Az :={y EY: (x,y) € A}. 
Let 6: X x Y — R be a function in general. Then for fixed « € X we define 
0. (y) := O(a, y) for all ye Y. Let E C X x Y. Then it holds that ye, = (ve)s, 
see p. 266, [312]. Also it holds for A; C X x Y that 


(n a = ( (Ade. 

i€l 2 ier 

Clearly here we have that (¢,)~ 7 fg. Furthermore we obtain 
(On) (t) * Xlam,y] (t) T fe (t) S (een 


Hence by the Monotone convergence theorem we find that 


bf. bm 
i, Canon Oe i Pie nace 


[ [ fa (tat = F(Z, y). 


m 
R22” n2” 


(dn)= (t) = 226 Wage =| i-Uxcaz) (#)- 


Here (A‘,)~ are Lebesgue measurable subsets of hates for all @, see Theorem 


6.1, p. 135, [235]. In particular we have that 


That is, we have 


By linearity we get 


n2” 


ae e-y [ x(aiys (dt + F(R), 
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and 


n2™ 


bm 
ae 1) f” Xena)“ Xtagye Bat + FC 2). 


m 


That is, 
n2” 


bm 
ae (i-1 | X lam yIN(As, halt) 3 dt — F(#,y). 


m 


More precisely we find that 


n2” 


2G — 1)d1((Ah) e A [ams yl) > F(@,y), 


as nm — +00. 
Here A, is the Lebesgue measure on R. Notice that 


((<72q" Iai, b]) x [ams yl)> = = [Qm, Y]- 


Therefore we have 


(A, 9 [x77 "Tai, bi] x fam, yl])-¢ = (An) a 0 [ams 9)- 


Here A‘. n bier le bi] x [am,y]] is a Lebesgue measurable set in x’, [a;, bi]. 
Clearly (A‘,)¢ M [am,y] is a Lebesgue measurable subset of [am,bm]. And by 


Theorem 6.4, p. 143, [235], we have that 
VW(@) = Ar ((A,)e 9 [am,y)) 


is a Lebesgue measurable function dined on x'>"[a;, bi]. Consequently F(a’, y) is 
a Lebesgue measurable function in @ € x7"3'[a;,b;] as the limit of Lebesgue mea- 
surable functions. Clearly we have ital so far that F(2, y) is a Caratheodory 
function. Hence by Theorem 20.15, p. 156, [11], F(a’, y) is a Lebesgue measurable 
function jointly in (#,y) on x”, [ai, bi). 

Next we notice that 


y bm 
r@wl< fle oars | F(@, lat. 


™m m 


And we see that 


bm 
l ne. plae < [ / \f(@,t)|\dt | dv 
x9 | [aids] xml fai,bi] a 


m 


<M<+4+o, M>0, 
by f being Lebesgue integrable on x7”, [a;, bj]. Hence 


bm 
/ (/ Fe ae dy < M(bm — Gm) < +00. 
fe x79 | [ai bi] 


m 


Therefore by Tonelli’s theorem, p. 213, [11] we get that F(21,...,2%m) is a Lebesgue 
integrable function on x7, [ax, bi]. 
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As a related result we give 


Proposition 28.2. Let f(21,...,%m) be a Lebesgue integrable function on 
xi? lai, bi], m EN; a; < bj), 4 =1,2,...,m. Consider 
Lp 
PY G04, Ads Bp gcen ebay) =| f(@1,@2,...,t,...,Lm)dt, 
ar 
for all a, < x,» < bp, r € {1,...,m}. Then F(a1,22,...,%m) is a Lebesgue inte- 


grable function on xi, [a;, bi]. 


Proof. Similar to Theorem 28.1. 


Above Theorem 28.1 and Proposition 28.2 are also valid in a Borel measure 
setting. 
Next we give the first main result. 


Theorem 28.3. Let go be a Lebesgue integrable function on x?,[ai,b:], m € N; 
a; < bj, 7 =1,2,...,m. We put 


GF ieee) al GO (Listy tees oe) ely (28.1) 
Gig Lases Wee Lin) = H Gk—1,r(L1,--+,t,..., Um) dt, (28.2) 
k=1,...,nEN; re {l,...,m} be fired. Let f € C"-1(x™,[ai, bi]) be such that 
oI 
Ls Ee ere ee =0, forj=0,1,...,n—1. (28.3) 
Ox}. 
Additionally suppose that: 
ov lf ; : ; 
aunt is absolutely continuous function 
Lr 
(28.4) 
with respect to the variable x,, and -— is integrable. 
Then J =i 
eo” 
if food® = (-1)” / ne(@) 2a. (285) 
x™ [aids] x7, [as,bi] Oxy! 
Proof. Here we use integration by parts repeatedly and we find: 
br 
F Cig es Bees een Pen OO Ety sony Bey ee cg Bm aap 
b 
if Ogi» 
— f(x1, »Ur, Xin) at (x1, »cr, Lm) dL, 
oe Ox, 
= Fl ge sg Days oh hm) Gi By iv Deg en) 
— f(x, Ory -+ +, 2m) G1,r(L1, ar, Sais) 


br 
— / Gig (Lig oss Creiny lin) lac Cissy aes aep 
a 


by 
= -{ Gi rE ye obras ss Cr) de (Lig iees Cress fon) dae 
a 


Integration by Parts on the Multidimensional Domain 383 


Similarly we have 


b, 
/ Ot (Bis. + Ong eh Vr) fang (LIS Gry as {Lp AL y 
a 


Pe oe - 
sv 
= Pp lias. Cpe Cn) (Bisse yds im aoe 
Gr 


OL, 
= fal Dist 5 Ops ne 0m) Gor (Lis sen gbey tn) 
— fr (21,-.+)@r,++-,Lm)G2,r(£1,---,Or,---,2m) 


bp 
= i Gor (Gig ses Oryesy Om) fase. (LI. is sEr yes 9) Lm) aor 
a 


r 


by 
-| Gat (Gise ne Lapiees Orda Elena tpteay laces 
a 


r 


That is, we find that 
br 
/ F(@1, +++) Bey + +) Bm) Go(F1; +++ Bry ++, Lm) Aer 
ar 


br 
= (-y? f G2,r(@1,---;2r,---, Lm) few, (L1,-+-5Ery---,Lm)dtyp. (28.7) 


r 


Continuing in the same manner we get 


Be ediss Bm JOLE 


b, 
/ f(@1,---,2r,---52m)go(@1,--- 
Ot 


Ox” 


br 
= (-1)" [ Gril Lrg @rsoess lm) (@1,.--,;Ur,-+-;Lm)dx,. (28.8) 


r 


Here by Proposition 28.2 gn,-(%1,...,;@%m) is Lebesgue integrable. Thus integrating 
(28.8) with respect to %,...,%p—1,Ur41,---;@m and using Fubini’s theorem we 
establish (28.5). 


Corollary 28.4 (to Theorem 28.3). Additionally suppose that the assumptions 
(28.3) and (28.4) are true for each r =1,...,m. Then 


ie ~_ foode = ear ae _ In B SLB YAR. (28.9) 


Proof. Clear. 
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Corollary 28.5 (to Corollary 28.4). Additionally assume that ||go|| < oo. Put 


(max. || 557|.)llgolloc 
i Se 28.1 
(n+ 1)! PEt!) 
Then 

M m es m 

H fgod%| < —~ S~ | (b, —a,) [[ @ -4:) . (28.11) 
x 7E1 las sds] sal j=l 
jAr 


Proof. Notice that 


Gri re (Ey yep pees Lay) = 
a 


Thus 
Lr (2p = ire 
[9n,r(@1,-+-Lr,+-+,Lm)| S [ ie Dee Il Gollo0 
(a Wie) 
= ——llgolloo- 
That is, 
lanr(B)| <I" aol, VE € xP la, bi 


Consequently we obtain 


n > 
/ tuo ®) FE aa 
x7 [a:,b:] 


< etl | 


n! 


(a, — ay)"day +--+ dam 
Of" [027 || 0llGolloo : ~ 
_ | f sea (b, = agers II (b; = a;). 


Finally we have 


(= a” f(z) =? - n+l - 
qe (a) aan <M So (br — ar) [[ @ = 4) 

‘ig r=1 g=1 

jAr 


That is establishing (28.11). 
Some interesting observations follow in 


Remark 28.6 (I). Let f,g be Lebesgue integrable functions on x7, [ai, bi], ai < bi, 
i = 1,...,m. Let f,g be coordinatewise absolutely continuous functions. Also 
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all partial derivatives fz,, gz, are suppose to be Lebesgue integrable functions, 
i=1,...,m. Furthermore assume that 


PBI 2005 Gijeery Bm) Hh (1) 0 bis 5m) = 0, (28.12) 


for all i = 1,...,m, ie, f is zero on the boundary of x',[a:,b;]. By applying 
integration by parts we obtain 


bi 
Gu, fda; = g(x1,...,bi,..-,2m)f(a1,...,bi,..-,;2m) 


4 


—g(@1,..-,Qi,. T (Bigcce Gigs Baa) 
-[ fa, gdx; = S, fx, GdXj. 
That is, we have 
bi b; 
i Je, fda, = — / fc; gdxi (28.13) 


for alli=1,...,m 
Integrating (28.13) over 
[ar bi] x ++ x [az, Bi] x ++ X [am Pra; 
(ai, bi] means this entity is absent, and using Fubini’s theorem, we get 


i G2, fd = — / jegax, (28.14) 
X71 lai ,bi] x™ [as ,di] 


i=l 


true for alli = 1,...,m. One consequence is 


/ i (>: o] dz = -| g: (> in] dz. (28.15) 
x71 lai b:] Psa xT  [ai,bi] i=l 


Another consequence: multiply (28.14) by the unit vectors in R™” and add, we 


obtain 
/ f-(Vg)d@ = -| g: (Vf)dz, (28.16) 
xm, [ai,bi] x™ , [ai ,bi] 


true under the above assumptions, where V here stands for the gradient. 

(II) Next we suppose that all the first order partial derivatives of g exist and they 
are coordinatewise absolutely continuous functions. Furthermore we assume that 
all the second order partial derivatives of g are Lebesgue integrable on x7, [aj, by]. 
Clearly one can see that g is as in (I); see Proposition 28.2, etc. We still take f as 
n (I). Here we apply (28.13) for g := gz, to get 


b; 
i (9x, )e ifdxj = -f fee; Ga; dxj. 


a 


And hence by Fubini’s theorem we derive 


/ Gxjx,f@ = — | LeGee es, (28.17) 
x7, [a7 ,di] x71 [ac di] 
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for alla =1,...,m. Consequently we have 


/ fol oye az = -{ Ge (>. in) dz. (28.18) 
xP [ai,bi] j=l x PE [aida] i=l 


Similarly as before we find 
/ f+ (Vo9n,)de = -| Ge (Vidz (28.19) 
fé1 [aa,ba] x, [ai,di] 


for all j =1,...,m 
From (28.13) again, if 7 = j we have 


bi by 
/ Jui, fda; = — i Sai Gx,A2i, (28.20) 
and consequently it holds 
| Guiei, fT@ = -| frGn0e. (28.21) 
x74 [aa d:] x™ | [ai,b:] 


Furthermore by summing up we obtain 


ees: bi] (Sn “| ian eas bi] (= . 1) 


That is, we have established that 


i (V?9) fda’ = -| (>: fe, o da (28.22) 
x7, lai d:] x™ | [ai bi] 


true under the above assumptions, where V? is the Laplacian. If g is harmonic, i.e., 
when V7g = 0 we find that 


ay | agen =0. (28.23) 
i= l@i> 


(III) Next we suppose that all the first order partial derivatives of f exist 
and they are coordinatewise absolutely continuous functions. Furthermore we as- 
sume that all the second order partial derivatives of f are Lebesgue integrable on 
xi@_,[a;,b;]. Clearly one can see that f is as in (I), see Proposition 28.2, etc. 

Here we take g as in (I) and we suppose that 


G(X1, +--+) Qi,---;Bm) = g(41,.--,0;,.--,Lm) = 0, (28.24) 


for allt =1,...,m. Then, by symmetry, we also - that 


xm | [ai ,b:] x™ , [ai ,d:] 


So finally, under the assumptions (28.12), oe and with f,g having all of their 
first order partial derivatives existing and coordinatewise absolutely continuous 
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functions, along with their second order single partial derivatives Lebesgue inte- 
grable on x/”,[a;, b;] (nence f,g are as in (I)), we have proved that 


if (V*Agae = | f(Vgaz =-y> f hoe 
x7, [ai,di] xT [aids] jal 2 Xi [ai,di] 
(28.26) 


From [36] we need the following 


Theorem 28.7. Let go be a Lebesgue integrable and of bounded variation function 
on [a,b], a< b. We call 


glx) = [ go(t)dt,... 


n(x) = [ Faw oot neéN, x € {a,b}. 


Let f be such that f"- is a absolutely continuous function on [a,b]. Then 


b n-1 
i fago = S>(-1)* F(B)gx(6) — F(a)g0(a) 
a k=0 


+(-1)” / gn—1(t) f™ (t)dt. (28.27) 


We use also the next result which motivates the following work, it comes from 
[91], p. 221. 
Theorem 28.8. Let R = {(2,y)|a<a2<b,c<y < d}. Suppose that a is 
a function of bounded variation on [a,b], GB is a function of bounded variation on 
[c,d], and f is continuous on R. If (x,y) € R, define 


n= [ sevddate) =f sewaaty. 


Then F € R(8) on [c,d], G € R(a) on [a,b], and we have 


[rw y= [ow )da(x 


Here R(G), R(a) mean, respectively, that the functions are Riemann-Stieltjes inte- 
grable with respect to 3 and a. That is, we have true 


[ (freer ) as = ff evnrate )) ax. (28.28) 


We state and prove 


Theorem 28.9. Let go be a Lebesgue integrable and of bounded variation function 
on [a,b], a< b. We set 


go(t)dt, ... (28.29) 


Gn(2) = ie EO gota neéN, x € [a,b]. 
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Let ro be a Lebesgue integrable and of bounded variation function on [c,d], c < d. 
We form 


ry) = [ rotwae, ie (28.30) 


rly) = a a —ra(s)ds, neéN, ye [ced. 


Tet f : R = [a, dhe [c,d] — R be a continuous function. We suppose for k = 
0,...,n—1 that fe ie Va ,*) ts an absolutely continuous function on [c,d] for any 


LE ae a,b]. Also we assume for €=0,1,...,n that f(x,y) is continuous on R. 
Then it holds 


[([ renane )) avin 


=F Fy p69 age( Orel a + role a Ett p0)( (.cin 


os £=0 ; 
+ So (-1)**" gx (b) ( i ra—i(y) fe” (b, a 
k=0 ¢ 
d 
+ (-1)"*#1g9(a) / rai (yf (a, vay 
n—-1 b 
+ So (-1)4r(a) ( / dna) f° (cr, ivr) 
£=0 - 


b 
+ (-1)"**ro(e) (/ Gn—1(2) fg”) sa 


d b 
+f (/ An awe, nite a. (28.31) 


Proof. Here first we apply (28.27) for f(-,y), to find 


b n-1 
/ f(a, y)dgo(x) = S>(—1)* (6, y)an() 
a k=0 


b 
~ $(c,¥)go(a) + (—1)” / gn-1(2) F(a, y)de, 
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where f means the kth single partial derivative with respect to x. Therefore 


| ‘ ( if "realdo()) dro(y) 


n 


—1 d d 
S7(-)an(b) / f(b, y)aro(y) — go(a) i fla,vdro(y) 
0 Cc c. 


k= 


d b 
+1" f (/ Gn—1(2) toner) dro(y) =: ®. 


Consequently we obtain by (28.27) that 


@= Thule) {PY HG, drat 


£=0 


d 
— f(b, e)ro(c) + (—1)" / rn) fay” vay} 


n-1 


 go(a){ 


£=0 


d 
IH (a,dreld) ~ Flaeyro(e) + (-" f° ma) APCa ay} 


b 
= ( ore inet) ro(¢) 
(n) 


aoa se ae [raw ( [oer ean) dy \ 


y 


Finally (28.31) is established by the use of Theorem 9-37, p. 219, [91] for dif 
ferentiation under the integral sign. Some integrals in (28.31) exist by Theorem 
28.8. 


As a related result on bivariate integration by parts we give 


Proposition 28.10. Let go and ro be as in Theorem 28.9. Let f : Ro := 
[a,b] x [c,d] — R be a continuous function. We suppose that f(x,-) is a abso- 
lutely continuous function on [c,d] for any x € [a,b]. Also we assume that fr, fry 
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are existing and continuous functions on R. Then 


| : | FGA iney) 


= f(b, d)go(b)ro(d) — f(b, €)g0(b)ro(e) + F(a, €)go(a)ra(e) 
d d 
— f(a,d)go(a)ro(d) + go(a) i, rol) fy(a, ¥)dy — go() y: ro(y) fy(bsu)dy 


b b 
= ro(d) ( / m(ohtte eh) +r9(0) ( i. oleate) 
d b 
ip ib ( | (ena) Fah) dy. (28.32) 


Proof. Apply (28.31) for n = 1, then k = ¢ = 0, etc. 
The next and final result of the chapter deals with integration by parts on R°. 


Theorem 28.11. Let g; be Lebesgue integrable and of bounded variation functions 

On |G, 0s), Gi <b. Dia Ly DiS Ber f A = RES 1 [ai bi] — R be a continuous 
of 2 3 

function. We suppose that fx,, 1 = 1, 2,3; xe. gods oo. os are all 

existing and continuous ee on kK. Then 


= gi(b1) enlba) eC h(is bo, bs) — gi(a1)g2(b2)93(bs) f(a1, b2, bs) 
— g1(b1)92(a2)93(b3)f(b1, a2, b3) + 91(a1)92(a2)93(b3) f(a1, a2, bs) 
( ) 
( 


— gi(b1)92(b2) 93 (as) f(b1, b2, a3) + 91(@1)92(b2) 93 (a3) f(a1, b2, a3) 
— gi(@1)92(a2)93(a@s) f(a1, a2, 43) + 91(b1)92(a2)93(a3)f(b1, a2, a3) 


by 
= galba)on(ba) | gi(x1 es 


Ba Of (#1, G2, b3) 
ay Ox 


dxy 


dx, 


bo 
+ gi(ar)ga(bs) [gal ny) 2 a 


by 
*. galba)ga(as) | gy (a) es) 


" X1,a2,a 
— ga(aa)a(a) [ aay eas) 


ai Ox, 


bo . 
+ gi(bs)ga(as) | ga( ay) 2s) 


dx, 


dx, 


dx2 
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Ox 


Of (bi, bo, x3) 
0x3 


Of (a1, ba, x3) 
0x3 


93(@3) 


93(@3) 


93(@3) 


(as)ga(aa) | 


a3 


0x3 


g1(©1)92(x2) Dad 


gi(@1)93(x3) 


+n ff 


gi(@1)g2(r2) 


Of (a1, X2, a3) 


Of (a1, a2, x3) 
0 f (a1, 2, a3) 
0? f (x1, bo, £3) 


0x3021 
0? f (x1, 2, bs) 


sional Domain 


dx2 


dx3 


dx3 


dx3 
dx2dx1 
Z1 


dx3dx, 


dx2dx, 


Ox202X1 
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Of (a1, a2, £3) 
0x302, 


0? f(b1, 22, £3) 


bo b3 
+ (br) ff gala2)aa(es) Ss 
f(as, 2, x3) 


a2 
bo ty yo 
— gi(a1) pf ok g2(#2)9a(v3) Ox30X2 
by is bs O° f (v1, 22,03) 
91(£1)92(x2)93(x3) =. —— 


023022021 
Similarly one can get such formulae on x'™,[ai, 


g1(21)93 (#3) dx3dx1 


dx 3dx2 
——_——dr3dx2 


————— drzdrodx1. (28.33) 


bi], for any m > 3. 


Proof. Here we apply integration by parts repeatedly, see Theorem 9-6, p. 195, 
[91]. Set 


by bo b3 
i if | f(@1, £2, 3)dgi(x1)dg2(x2)dg3(a3). 


The existence of J and other integrals in (28.33) follows from Theorem 28.8. We 
observe that 


bg 
f (1, 2, %3)dg3(x3) = 


a3 


f(x, v2, b3)93(b3) we 


b3 
[ 


ndCee v2, a3) 93(a3) 


Ofer, 22, x3) 


d. 
0x3 e 


93(x3 
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Thus 


bo b3 
/ ( Heyras)dot) dg2(x2) 


3 


bo b2 
= ga(bs) | f(x1, 22, b3)dgo(x2) — gs(as) f(x1, 22, 43)dgo(xr2) 


a2 


bo bs s 
-| (/ ole) Lett a) dg2(x2) 


be 11,2 
= oa { Fe baba) = Floren neon ~ f (en) Mesa t | 


be 11,22,a 
— g3(a3) {eta — f (x1, a2, a3)g2(a2) -| rlen) Les tt doh 


bs 21, 62,2 bs 21, 02,2x 
- { (/ nfo) Aaa a) g2(b2) — (/ (eo) As ettt ) g2(a2) 


= Bes iO fina 
-| go(a2) Py eek Pree 
a2 a3 Ox30X2 


Therefore we obtain 
by bi 
I= an(ba){ alo) (/ Hebb) (e) — g2(a2) ( Fea bn(o)) 


bi bo x 
f(g) wn 


by bi 
= an(aa){ gab) | f(x1, 62, a3)dgi (x1) — go(a2) f (x1, 42, a3)dgi (a1) 


a1 


b ba %1,0%2,a 
= / (/ olen) Ltt) an(a)} 
by bs i 
= {a2(0) (/ (/ n(o Lett a) iota) 
— g2(az) (/ ace 93(x3 20st ion) 


-[ ‘if g2(x a(f 93 on) Aa) is) inter), 
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Consequently we get 
I= an(b0){ gallo) ce bz, b3)91(b1) — far, ba, b3)g1(a1) 


0x4 


bi 
— f(a1, a2, 63)91(a1) -| in(e,) ee | 


ue X2, 

= (/ olen) Ate r) gi(b1) 
8 a1, %2, 

- ( 1 olen) Lets as.) atte 


by ba 2 
a [ gi(a1) (/ rten) Ht as.) a] 


= sala) {gab fc be, a3)91(b1) — f(a1, b2, a3) 91 (a1) 


b Of (x1, bo, 
_ ‘ in (0) rea, | 
ay 1 


= a g(r a, | — g2(a2) ows bs)gi(b1) 


— g2(a2) ce a2,a3)91(b1) — f(a1, a2, a3)91 (a1) 


bi 
- : (0) Ee) 
ay Ty 


be ite 
= (/ (en) Aste a.) gi(b1) 


| 
7 (s olen) Lett do) (a1) 


2 ba 2 F(a, 2 
~ ftom (fate pagan) a} 


0x3 


by bs fay b 
- fa) (/ (eo) HP a, ae,| 
ay a3 L302, 


bs a2,2 
— g2(a2) (/ role) Aste a) gi(b1) 


7 (/ (eo) A a gi(a1) 


0x3 
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b3 a 
= (/ n(oy Hist an) gi(ai) 


3 


bi bs 2 
- [ater (/ (eo) Hata a ae,| 


3 


ie m8 0? b ’ ’ 
ales [om g2(@2)93 (a3) rsd gi(b1) 


” i: 0? ’ ’ 
-(f. is g2( XQ )g3 (x3) feet) 28) ops di gi(ai) 


b b b 
1 2 3 63 ; , 
2 [ gil £1) (/ a g2( x2 )93 (x3) Pfs tt) arr) ae,| \ 


Above we used repeatedly the Dominated Convergence theorem to establish 
continuity of integrals, see Theorem 5.8, p. 121, [235]. Also we used repeatedly 
Theorem 9-37, p. 219, [91], for differentiation under the integral sign. 


16. 
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